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PREFACE 


In this exposition I hope to illustrate certain basic notions 
and methods of probability theory by unraveling them in a 
simple and fruitful case: that of the Brownian Motion Process 
applied to a pair of famous problems in Electricity. One is 
the boundary value problem initiated by Green which begins 
this book, and the other is the equilibrium problem formerly 
associated with an energy principle, which concludes this book. 
The first problem is solved by the distribution of the Brownian 
paths at their first exit from Green’s conductor, and the second 
problem by that of the same paths at their last exit there- 
from. To adopt a classic Chinese rhetorical turn: “if one could 
resurrect and ask him” [4% #&a¢ f@ )2] what would Green 
say? 

The book is short enough not to require a summary or 
abstract. The undecided reader can easily leaf through the 
hundred generously spaced, well printed pages to see if he/she 
is “willing and able”. To quote Emerson: ““Tis the good reader 
that makes the good book”. In my writing I have taken a 
generally inquisitive, tutorial stance, interposed some amusing 
history and personal reminiscences, indulged in a few cavils 
and slangs, frequently taken the reader into my confidence, 
with quizzing and teasing. Here are some vista points not 
to be missed: my inspiration comes from the Eulerian inte- 
gral over time of the DeMoivre—Laplace—Gauss normal density 


vi Preface 


that yields the Newton—Coulomb potential, exhibited in (2.3). 
The thread of probabilistic thinking unrolls from the spon- 
taneous splitting of the time-arrow exhibited in (3.4). The 
great random march starts with the “post-option-temporal- 
homogeneity” announced in (4.11). From this it is really a 
short step, barring formalities, to Green’s electrical discoveries 
embodied in (8.9) and (8.11). 

Although the contents had been maturing in my mind over 
years, I did the final writing in four months beginning near 
the end of the year 1994. Even over such a period of time my 
mood was subject to fluctuations which must have affected the 
selection of material as well as its presentation. Suffice it to 
tell that for each Note preserved in the final version another 
probably has been discarded, and no claim is made to any 
uniform standard of relevance or circumspection. I have also 
been quite casual about the references, not all of which are 
fully documented or listed. As an innovation I have appended 
six photocopies of original writings, first pages of books and 
articles, at the end of the volume, not as museum pieces but 
as moments of recaptured time. 

Portions of the book have been rehearsed in various 
audiences, in: Mexico (1991), Dalian, China (1991), Lawrence, 
Kansas (1992), Columbia, Missouri (1992), Auburn, Alabama 
(1995), Milan, Italy (1995). It is a pleasure to take this 
opportunity to thank my hostess and hosts on those mem- 
orable occasions: Maria Emilia Caballero, Diego Hernandez 
(deceased), Litze Xu, Tyrone Duncan, Zhongxin Zhao, Olav 
Kallenberg, Bruno Bassan. 

World Scientific Publishing Company is publishing my 
Chinese book “A New Introduction to Stochastic Processes” , 
(written with the assistance of Rong Wu). Its undertaking 
to publish this somewhat unconventional “new venture”, with 
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the tremendous efficiency and splendid cooperation of its staff, 
deserves particular recognition and appreciation not only on 
my part, but also, let us hope, by prospective readers of the 
World. 


Kai Lai Chung 
July 3, 1995 
Stanford, California 


In the new edition I have added the important Addenda 
and inserted some titbits of new information at various places. 
These were acquired through correspondence with acquain- 
tances or from welcome strangers. The names of contributors 
are given without titles, as before. In March 2000 I received 
a copy of “George Green” (The Athlone Press, 1995) from the 
authoress D. M. Cannell, accompanied by a letter. She an- 
swered my “speculation as to whether Green and Brown ever 
met”, saying that she “would be prepared to say (if not to 
publish!) that they never did”. I then wrote to ask her if there 
exists a photo or painting of George Green, since he does not 
appear in the twenty plates reproduced in her book, including 
Green’s family mill which has been restored. Unfortunately 
she died shortly after and it was Lawrie Challis who wrote to 
say “no”. Such a pity. I am indebted to Robert E. O’Malley 
Jr. who mentioned my book to Green’s biographer. 


March 17, 2001 
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SOME NOTATION 
AND TERMINOLOGY 


is Euclidean space of dimension d. 

= [0, 0). 

is the class of Borel sets in R¢, called the Borel tribe. 
is the class of bounded Borel measurable functions. 


is the class of Borel measurable functions which vanishes 
outside some unspecified compact set; it is also said to 
have compact support. 


is the class of continuous function on the set D; when 
(D) is omitted (R*) is meant. 


are defined like B,, B. and usually come with (D) as in 
C(D). 


is the subclass of C' which are twice continuously differ- 
entiable in the open set D; you can guess what C” and 
C®™ mean even with the subscript c adjoined, but watch 
out if C? is used it could mean different things. 


is the Borel-Lebesgue measure in any R¢, dm is used as 
the differential but dz or dy is also used. 


is used for the often omitted sample point in the sample 
space 22. 
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BMP is about the only acronym used here and reads Brownian 
Motion Process, which is also referred to without the 
acronym, and often without the word Process; the word 
Movement is also used instead of Motion. 

“Almost surely” or “a.s.” means “with probability one”; more 
particularly, “P*-a.s.” restricts the probability to P*. 


1. GREEN’S IDEAS 


In order to tell the story a little in its historical perspective, 
I shall begin with George Green’s original ideas, published in 
1828.1 

A formula is customarily taught in a second year calculus 
course, known as Green’s Second Identity: 


Ov Ou 
[ude — vAu)dm = / (ue — 5) do . (1.1) 
D aD 


Here D is a “region” in R°, OD its bounding surface, dm 
(usually written as dx) is the volume element, do is the area 
element, u and v are two functions, A (sometimes written as 
V7) is the Laplacian, and Zz is the outward normal derivative. 
In an elementary course these terms are introduced casually; 
for instance what exactly is a “region”? We shall not bother 
with tedious formalities until later. As a matter of fact, a 
rigorous demonstration of the identity under appropriate con- 
ditions cannot be attempted in a sophomore or even a graduate 
class. 

Let us begin with D. We assume it to be an open, nonempty 
(how stupid to have to say that!) and connected set, and call 
such a set a “domain” from here on, abandoning the often 
ill-defined term “region”. 

A function v is called “harmonic” in D iff v € C?(D), and 


satisfies the Laplace equation in D: 
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Av = 0, As (1.2) 


Without further ado, we will apply (1.1) with an arbitrary 
harmonic function v and a very special function u. The latter 
carries a “parameter” and will be denoted by uz(-) or u(z,-): 


1 

ue(y) = ua) = (1.3) 
You may recognize this as Newton’s gravitational potential, as 
well as Coulomb’s electrostatic potential. Apart from a nu- 
merical constant, of the utmost importance in physics (and 
consequently for the real world), but quite ignorable in math- 
ematics, u(z,y) represents the potential induced by a mass or 
charge placed at the point x and evaluated at the point y. For 
each x, we have (A, is A with y as variable) 


Aytzs(y)=0  ifyAz; (1.4) 


namely uz is harmonic in R*\{z}. 

You should verify (1.4) as an exercise although it is a dull 
one. You may have already done so in your calculus course, 
and now you will find out what it is all about. Owing to 
the little exception indicated in (1.4), Green’s identity (1.1) is 
NOT applicable with u, as the u there when the domain D 
contains the point z, as we do assume. We can get around 
the difficulty by surrounding x with a small ball B = B(z,r), 
such that B Cc D, and apply (1.1) to the domain D\B.? By 
the way, from now on we will assume D to be bounded. Since 
Ayv = Ayuz = 0 in D\B, and 0(D\B) = (OD) U (OB), (1.1) 


reduces to 


Ov Our 1 Ov 1 
oD OB 
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because on 0B, we have u = 4 and ou = —4. The last integral 
can be evaluated by the mean-value theorem (of integrals!) as 


(ow) = =v") Amr? 
where y’ and y” are two points on OB, and 4rr? is the area 
of 0B. (Not many American students know or care to know 
the area of a sphere or the volume of a ball; but I hope you 
do.] Since v is harmonic in D, both it and &% are bounded on 
OB as r decreases to zero, while u(y”) converges to v(x). Thus 
when we shrink B to {x} we obtain from (1.5): 


—Arv(x) = / (ee ~ ex | do, ceD. (1.6) 
dD 


This relation holds for any harmonic v and any x in D. As 
Green observed, it represents v by its boundary values together 
with those of gu But the latter is a nuisance and must be got 
rid of. How? Here is his bright idea. Apply (1.1) once more 
with the same v but another truly harmonic h, for u, so that 
this time there is no trouble and (1.1) yields at once 


Ohy Ov 
o= f (Fev— hep) do (1.7) 


aD 


Now subtract (1.7) from (1.6): 


Arve) = f (A ely — (a — a) 5 do. (1.8) 


aD 


In this formula h,(-) is any function harmonic in D. If we can 
choose it so that it be equal to u,(-) on OD, then the unwelcome 
term involving 2% in (1.8) will be knocked off, and the result 


will be a representation of v by its boundary values, as desired: 
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v(t) = f 5 (ue ha) vd (1.9) 
aD 

The problem of finding such an h, became known as the 
“first boundary value problem” of Dirichlet’s. But for Green 
it was “no problem” (pas de probléme) at all. Why, the solu- 
tion is given by NATURE: hz is nothing but the potential of 
the charge induced on the grounded surface by a unit charge 
at x; provided that you understand this lingo of electromag- 
netism! Here I should like to quote from Oliver Dimon Kellogg 
([pp. 237-238)): 


“We have here an excellent example of the value and 
danger of intuitional reasoning. On the credit side is 
the fact that it led Green to a series of important dis- 
coveries since well established. On the debit side is its 
unreliability, for there are, in fact, regions for which 
Green’s function does not exist.” 


We shall solve the boundary value problem by probabilistic 
method. For the moment suppose the requisite h, has been 
found, and suppose that it has a continuous normal derivative 
on OD as appearing in (1.9) — something Green probably never 
worried about but Kellogg did work on. Then it is clear that 
the problem is equivalent to the determination or construction 
of the function 


(x,y) 5 Uz(y) aa ha(y) . (1.10) 


This function of two variables is now called Green’s function. 
apart from a numerical factor which still varies from author to 
author. Many books with it in the title have been written. 


2. PROBABILITY 
AND POTENTIAL 


In the scenario of the preceding discussion, there is only 
space and no time. But it is generally perceived that TIME is 
the mysterious element of our universe. Consider the function 
of three variables which should be familiar if you have studied 
any probability or statistics: 


1 = e~y|? 
Pi 2,y)=—aapme * (2.1) 


(Qrt)4/2 
where t € (0,00),  € R4, y € R4,d>1. For fixed z, p(t; z,-) 
is the normal density function with mean x and variance t. 
If we regard the parameter ¢ as time, it becomes natural to 
integrate “over time”. For d = 1 and d = 2, the result is the 
disappointing +oo. For d > 3, however, we get the fascinating 


Co 


r(g-1 1 
J vtese. wat = OO a . (2.2) 
0 
You should carry out the “definite integral” above without re- 
course to a table or handbook, thereby learning Euler’s in- 
tegral for his gamma function I. vou should also find out 
the value of Is ) which will yield aes eT for the right-hand 
member of (2. 2) when d = 3. From now on we take d = 3, 
although for all d > 3 the results are quite similar except 


9 


10 Green, Brown, and Probability 


for certain constants depending on d. The case d = 1 can 
be treated by easier methods (what is a harmonic function in 
R'?). The case d = 2 is unfortunately more difficult and must 
be left out in spite of its appeal and import. 

Our first recognition is that integrating p(t;z,y) over all 
time ¢ leads us to the Newtonian potential (1.3) apart from a 
numerical factor. Let us introduce a new notation to absorb 
the latter: 


, 1 1 
0 


At the same time, we introduce an operator G as follows: for 
suitable functions /: 


Gra) = f ale. flwday = = ~ | (2.4) 


|x — wi 
where the integral is taken over all R?, as also in similar cases 
below. We are using f as a dummy variable in (2.4) in the 
sense that G maps f into Gf. It is usually sufficient to use 
a class of functions (called “test functions”) to determine the 
operator G; for example the class C,(R?). But if you are not 
familiar with such doings you should do two exercises in this 
connection: 


(i) if f € B, then Gf € Cy; 
(ii) if f = 1 then G1 = oo; hence the class Cy is not suitable 
for (2.4)! 


Having introduced G we might as well define the operator P; 
associated with p(t; -,-): 


Pef (a) = : p(t, y) f(u)dy « (2.5) 
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In view of (2.3), we have then symbolically: 


CO 


ar= frp o a= | Pat. (2.6) 
0 


0 


Let me take this opportunity to review an indispensable tool 
of all analysis: the theorem of Fubini and Tonelli. For a given 
f, the validity of (2.6) amounts to reversing the order of two 
integrations in 


ape) =f ( f vsx,v)ae] seday 
R3 0 
=f { freer |a=f area. 
0 \R3 0 (2.7) 


If f > 0, then according to Tonelli’s supplement to his teacher 
Fubini’s theorem, such a change of order of integrations is al- 
ways OK, whether the result of the double integration be finite 
or infinite. Thus we have 


IF I(y) 


an J |z—yl 


Gill) = 5, 


ay = f Piif\(e)at < +00 . (2.8) 


Now if the result above is “< oo”, then Fubini’s theorem as- 
serts that we can replace |f| everywhere in (2.8) by f and the 
equations will also hold. This is the kind of application we shall 
make again and again, in diverse similar circumstances. Allow 
me to call attention to a common faux-pas: we must check 


J Palfi(aat < oo in the step above, not f (P.f\(a)dt < oo! 


This sort of mistake has been made by eeactisine mathemati- 
cians in a hurry, beware. 
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The relation (2.3) furnishes a vital link between two big 
things. At one end we see the Newton—Coulomb potential; 
at the other end the normal density that is also known as 
the DeMoivre-Laplace-Gauss distribution, (alias the “error 
function”). Can the linkage be a mere accident, or does it 
portend something of innate significance?? 

I do not know if the clue hidden in (2.3) bridging the 
random and the deterministic had been noticed by earlier 
mathematicians and physicists (they were the same group for- 
merly). Oddly enough, there was an intervening development, 
from botany. 

Robert Brown was a contemporary of George Green, 
though he was a Scot, not an Englishman. He was a noted 
botanist who made his microscopic observation of plant-pollen 
movements in 1827, published in 1828, the same year of Green’s 
Essay. (Did the two ever meet?) Tiny particles suspended 
in liquid move about in a highly irregular fashion. Naturally 
their paths have got to be continuous, but they were seen 
apparently to be so irregular that they were believed to be 
non-differentiable by the physicist Jean Perrin, who had heard 
about Weierstrass’s discovery of such pathological functions. 
[I learned this first around 1937 from Titchmarsh’s Theory 
of Functions.} But it was Albert Einstein* who first used a 
probability model to study the random movements of those 
pollen particles that became known as Brownian motion.® 


Hinstein’s original papers on the subject are now available 
in English. Here is a slightly edited version of the relevant 
portion. (see [Einstein; pp. 14-15]). In a short time interval 7 
the relative proportion of particles which experience a displace- 
ment between 6 and 6+dé6 can be expressed by probability law 
(6) such that 
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| o(5)d5 = 1; i: 5:0(8)d6 = 0; 
; / 5(6)d5 = Dr , (2.9) 


where D is the coefficient of diffusion. Put f(x,t) to be the 
number of particles per unit volume at location x and time t. 
Then on the one hand we have 


fla,t+7) = flat) + Sor 


and on the other hand we have 


Hg if Fla + 6,t)0(8)d5 


—oo 


= f [ten + Fho4 ote (5) d5 


in which the higher infinitesimals have been ignored. Hence by 
confronting the two approximations, one with respect to time 
and the other with respect to (randomized) displacements, he 
obtained the partial differential equation: 


Of _ oO’ f 
Be oe Gay) 
from which he concluded that 
f(a,t)= Lem (2.11) 


V4nr Dt 


where n is the total number of particles. 
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In his argument it was assumed that the movements of the 
single particles are mutually independent. This is standard in 
statistical mechanics, formerly called “gas theory”. His basic 
assumptions for the movement of the particles will be quoted 
and discussed later in Sec. 5. Let us also observe the crucial 
assumption contained in the third equation in (2.9) that makes 
(2.10) possible. We shall return to this with hindsight. For the 
moment we note that if D = 4 then the f(z, t) in (2.11) reduces 
to the p(t;0,z) in (2.1) with d= 1. It sufficed for Einstein to 
treat one of the three coordinates, of course. The extension of 
(2.10) to R? is known as the heat equation: 


of 1 
Be = aM: (2.12) 


It is a good exercise to verify that for each z, p(t; x, -) satisfies 
this equation. Remembering Eqs. (1.4) and (2.3), it should 
arouse our curiosity if we can use the (2.12) for p(t;z,-) to 
confirm (1.4)? Indeed we can, as follows: 


lo. <) 


Ayu(z,y) = Ay iy p(t; x, y)dt 


= i Ayp(t; 2, y)at = / 2< w(t x, y)dt 
0 


= 2p(0o; x,y) — 2p(0+;z,y) =0-0=0 
(2.13) 


provided x # y! This may be heuristic but since it comes out 

OK, it must be justifiable. It is most satisfying to see that we 

need x £ y to justify lim p(t; x,y) = 0 in the final step above. 
> 


3. PROCESS 


We are going to set up a probabilistic model known as the 
Brownian Motion Process, to be abbreviated to BMP in later 
references. A stochastic process is a parametrized collection 
of random variables on a probability space (Q,7,P). The 
parameter will be called “time” and denoted by t. In the case 
in question, ¢t € [0,0o) = Ro so that time has a beginning but 
no end. The process is then denoted by {X;} with the range 
of ¢ understood. For each t, the random variable X; is the 
function w + X;(w) from 2 to a state space which will be R%, 
usually with d = 3 as in previous discussions. 

It is my intention to formulate the hypotheses for the BMP 
in several stages, as they are needed in the gradual unraveling 
of the basic properties of the process. The first hypothesis is 
based on Einstein’s calculations. 


For each t > 0, 
X;—Xo has the probability density p(t;0,-) . (3.1) 


The distribution of Xo, (viz. the initial distribution of the 
process) is not specified. If it is the point mass €,, namely if 
Xo = x, then X; has the density p(t;x,-). Probabilities and 
mathematical expectations under the assumption Xo = a will 
be denoted by P® and E* respectively. Thus P*{X; € A} = 


f p(t; x, y)dy. 
A 
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Once we have a process we can make up new random 
variables. For each w, consider the time integral: 


Rice (X;(w))d wel. (3.2) 


For a suitable f this will exist, and the resulting function 

of w will be a random variable, namely #-measurable. But 

does it signify anything? When f is the indicator function 
co 


1g of a Borel set B, the integral f{ 1g(X;z)dt represents the 
0 


total “occupation time” of B, an interesting quantity in certain 
problems. Let us calculate the mean of the random variable in 
(3.2). Using Fubini—Tonelli, we obtain 


| sexoee - [etna 


7 / P.f(x)dt=Gf(e). (3.3) 


Thus the operator G introduced in Sec. 2 can be endowed with 
a probability meaning. Next, take any time T and split the 
integral in (3.2) as follows: 


T co 
f(Xiat = i f(Xz)at-+ f FoCat 
0 oe. 


oO, 9 


f(Xry+)dt . (3.4) 


l| 
Om by 
ea 
i 
& 
+ 


Process 17 


To exploit this seemingly innocuous relation to great advan- 
tage, we will coin a new name and a new notation. “Notation 
can be incredibly important!” (see [R]). Denoting X74: by 
X7, we re-invent a new process {X/}, t € Ro, to be called 
the “post-T process”. For T = 0, it is the original {X;}. 
The distribution of the initial random variable Xf = Xr is 
determined by that of Xo, by Hypothesis (3.1), as follows: 


P*{XT € A} = Pr(e, A) = / p(T;a,y)dy (3.8) 
A 


for any Borel set A. More conveniently, we have for suitable 
f, for example f € C;: 


E*{f(Xo)} = Prf(2) . (3.6) 


From this we derive that if Xo has the distribution yz, then the 
distribution of Xr will be f u(dx)Pr(z,-). 
Now we state the second fundamental hypothesis for {X¢}. 


For each T > 0, and t > 0, x - Bae 
has the probability density p(t; 0,-) . (3.7) 


This includes (3.1) and the “temporal homogeneity” of the 
process, namely for each T’' > 0, the post-T’ process obeys 
the same probability law as the original process, except in 
its initial distribution. Thus the hypothesis is a natural one 
for botany or physics, in which the results of observation or 
experimentation should not depend on the specific time when 
it is carried out, “provided that the temperature, pressure, 
humidity, ... , remain constant”. 
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Thus, the distribution of [ f(X,)dt under “Xo = y” is the 
0 


same as that of f f(X/)dt under “XJ = y”. Let Xo = 2, 
0 
then by (3.5), the distribution of XP is Pr(z,-). By (3.7), 
foe) 
the distribution of f f(X/)dt must be the same as that of 
0 


J f(X4)dt when Xo has the distribution Pr(z,-). Therefore 
0 


we have 


| [rade - [rrinaner} [xa (3.8) 


In view of (3.3), the right member in (3.8) may be written as 
PrGf(z) = E*{Gf(Xr)} . 


Pushing symbolism further, we can record (3.8) using only 
{X¢} as follows: 


E* | f(Xr4s)dt } = E® < EX? | f(X)dt| > . (3.9) 


This is the form of the result to be found in standard texts. It 
is an abuse of notation to be tolerated: when 7’ = t the two 
appearances of X; denote quite different things! 

We now return to (3.4) and take E® to obtain 


T 
Gf (2) = E* / jx + PGi). (3.10) 


Frankly, so far the result can also be derived from a little 
operator theory, without the intervention of the process {X;}. 
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For we have Pri: = PrP,, and consequently 
co T lore) 
G= [pat = [Pet + [Pere 
0 0 0 
T oo L 
= [ Pats Pp [ Pat= | Pats Pr, 
0 0 (3.11) 


which is (3.10) in a symbolic form. So, what’s the big deal? 
It is the next step that will separate the process {X;} from 


its transitional operator {P;}, “the men from the boys”, a 


nd 


show the true worth of probabilistic methods beyond the reach 
of mere operators. To emphasize this I will begin a new section. 
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4. RANDOM TIME 


A stochastic process is much more than its transition 
semigroup. The random variables { X;} generate other random 
variables that have no counter parts in the other theory. To 
recognize this it is necessary to describe the process in a more 
formal way as a function of two variables: 


(w,t) + X(w,t), wEN, tER; (4.1) 


where w is the dummy variable in the “sample space” (2, while 
t is the time as before . To each sample w from 2 corresponds 
the function t + X (w, t), namely the “sample function” X (w, -) 
from Ro to R3. The latter is also called a path or trajectory. 
Thus, instead of viewing the process as a collection of random 
variables indicated by the notation {X;} indexed by t and with 
w omitted, we can view it as a universe or collection of sample 
functions {X(w,-)} indexed by w and with t indicated by a 
dot; or more explicitly as in (4.1) with X;(w) = X(w,t). 

Now when one has two variables x and y as in planar 
geometry one often needs to substitute a function, say y, of 
x for y to get (x, y(x)) which represents a curve: x — y(z). 
So we need to substitute a function T of w for t to get from 
(w,t) the random time (w, T(w)) or w > T(w). 

Let us introduce at once two plenipotent random times 
which will be our main topics in the sequel. For a Borel set A, 
we define: 


21 


22 Green, Brown, and Probability 


Ta(w) =inf{0<t<oo: X(w,t)e A}, (4.2) 
La(w)=sup{0<t<oo: X(w,t)e€ A}. (4.3) 


Of course the time-sets between the braces above may be 
empty, in which case the standard convention applies: 


inf @ = +00, sup? =0. 


Obviously T',4 is the “first entrance time” in A, but owing to 
the tricky infimum in its definition, it may be better to call 
it the “hitting time” of A. As for D4, the natural name is 
the “ last exit time” from A, but I have also used the trickier 
“quitting time” to match the hitting time. 

The two random times T4 and L, have fundamentally 
different characters. The former is often called a “stopping 
time”, but is better called “optional”. To explain optionality it 
is necessary to bring forth the Borel tribe (o-field) generated by 
the random variables of the process. For each t, we denote by 
F, the Borel tribe generated by {X,, 0 < s < t}; for technical 
reasons we may have to augment this by “null sets”, a tedious 
affair not fit for casual discussion. Then a random variable 
w —> T(w) is called optional iff for each t we have {T < t} € F;. 
This is described by saying that we can determine whether the 
random TJ occurs before ¢ or not by our information on the 
process up to time ¢, nothing after ft. Even more intuitively, 
we say that we can choose (opt for) T without clairvoyance. 
This is actually the language borrowed from a gambling sys- 
tem which inspired J. L. Doob to study such random times in 
his martingale theory, circa 1940. Apparently the physicists 
had used the term “first passage time” for T,4, but they did 
not bother to define mathematical terms since they knew what 
they were talking about. Now let us take a nice set A and see 
if we can check that 
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{Ta <theF,. (4.4) 


This is an instance of the clash between intuition and logic! 
The Borel tribes are rigidly defined purely mathematical 
notions, they do not submit facilely to intuitive handling. Even 
for a simple case as when A is the complement of an open 
ball, (4.4) is not true without a rather strong assumption on 
the process {X;} which will be stated shortly. Nevertheless, 
G. A. Hunt has proved that (4.4) is true for a class of sets 
A including all Borel sets, even the analytic or Souslin sets 
(the augmentation of F; mentioned above is needed), for a 
large class of stochastic processes now named after him: Hunt 
process. It is time to state the next fundamental hypothesis 
for our BMP. 


For each w € 2, the sample function X (w, -) 


is a continuous function on Ry to R? . (4.5) 


This is the continuity of the Brownian paths taken for granted 
by Perrin and others (I do not know if Brown ever asserted 
it). It was first proved by Norbert Wiener [D] and it depends 
heavily on the normal distribution in (2.1) as well as the 
independence assumed by Einstein in his investigation (which 
is not yet incorporated in the present unraveling). His proof 
was difficult, as many first proofs were. Paul Lévy who 
regretted that he missed being first,° gave an easier proof which 
can be read in my recent book [F]. A proof derived from more 
general considerations can be found in [L]. Now let us see how 
to show (4.4). We have 


{T4 <t}={dse€(0,t): X,eA}. (4.6) 


Warning: this too must be proved. If we change the “<” 
on the left side to “<”, and simultaneously the “(0,t)” to 
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“(0,t]” on the right side, the result becomes false (for an open 
A)! Now isn’t it “intuitively clear” that the right-hand set in 
(4.6) belongs to F;? We need to be formal and write this set 
logically as |) {Xs € A}. Of course for each s in (0,¢), 
s€(0,t 

the set {X, : 24 belongs to F; but the union is over an 
uncountable collection and nothing in the definition of the 
Borel tribe F; says such a union must be in it! Tribes are 
creatures of countability. So let us take a countable subset of 
(0,t), for example the rationals denoted by Q. Can we show 
that 

U {Xe 4}= [ {X.€ 4}? (4.7) 


s€(0,t) s€(0,t)N@ 


Here some kind of continuity of X, in s will help, but we must 
still subject the set A to a topological condition in order to 
utilize it. If A is an open set and X(s) is continuous in s, then 
X, € A implies X, € A for rational r sufficiently close to s; it 
follows that (4.7) is true. The argument shows that one-sided 
continuity of s > X, is sufficient. Indeed, it is sufficient that 
the sample functions X(w,-) be all “separable relative to Q”, 
where @ may be any countable set that is dense in Ro. This 
“separability” is a mini kind of continuity property. According 
to an ancient theorem of Doob’s, any stochastic process has 
a “version” in which all sample functions have this property.’ 
To understand the meaning and use of separability, it is better 
to read my exposition in [M; II.4] for Markov chains because 
there it is truly indispensable and used “at every step”. 
Having proved the optionality of TJ’, for an open set A, 
we will take a closed set A and approximate it by its open 
neighborhoods, as follows. Put Bn = {x : d(z,A) < +} 


where “d” denotes distance; then B, is open, B, D Bry, 
oOo oO why 

(| Bh = (| Bn = A. Using the continuity hypothesis in 
n=1 n=1 
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(4.5), we should have 
limTp, =T4 (4.8) 


where the limit is approached strictly increasingly. It then 
follows that for each t > 0: 


CO 
{Ta <t}=(){Tp, <theF, (4.9) 
n=1 
and we have proved a little more than needed for the option- 
ality of T4. In these matters hairs must be split: it makes a 
difference whether we have “< t” or “< t”, but here (4.9) is 
more than (4.4) (why?). I have written down the basic idea of 
the argument above in order to warn the unsuspecting reader 
of a trap! It is a big mistake or a small mistake according to 
each one’s taste, and I will leave it to you to find it out yourself. 
Here is a hint: let the path start from A to see what can go 
wrong with (4.8). The author of a book on Brownian motion, 
a respectable mathematician of my acquaintance, overlooked 
this pitfall although it was forewarned in my book [L; p. 72]. 
There are more than one way to salvage the proof, either by a 
detour as done in [L; p. 71] or by a more direct method told to 
me by Giorgio Letta when I lectured on it in Pisa, 1989, which 
was recorded in [F, p. 19]. 

Before proceeding further let us explain some notation to 
be used. For any function w —> T(w) we can define the function 
w —+ X(w,T(w)); this is usually denoted by X. Similarly for 
any t € Ro, the function w ~ X(w,T(w) +t) will be denoted 
by X74. But if for some w, T(w) = +00 as allowed in (4.2), 
then since X (w, +00) is undefined we cannot define X (w, T(w)) 
for that w. Occasionally it is feasible to extend the definition 
of X(w,t) to t = +00; but unless this has been done we must 
restrict X7+4 to the set {T < oo}. We must also assume that 
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the function (w,t) > X (w, t) is jointly measurable F x B, where 
B is the classic Borel tribe on Ro, in order to assert that X7++ 
is a random variable, namely F-measurable. 

After these preliminaries we are now in the position of 
extending our previous post-T process from a constant T’ to 
a finite optional time T: 


XP =Xrit, tERo. (4.10) 


Since a constant time is optional (prove it!) this definition 
includes the previous one in Sec. 3. Observe that when T' is 
optional, so is [+t for each t > 0. This is an important remark 
though we may not see its use below. Now we announce our 
next, truly revolutionary hypothesis, to be called “post-option 
homogeneity”. 


For each finite optional time T, the post-T' process {X/'} 
obeys the same probability transition law as the original 
process {X;}. (4.11) 


This means, by referring to (3.7), that for each t > 0, 
X} — Xf has the probability density p(t;0,-). Needless to 
say, the hypothesis (4.5) implies that all sample functions 
t + X(w,T(w)+t) of the post-T process are continuous in Ro. 
Finally, what happens to (3.5) and (3.6)? Pr is not defined 
for a random T, yet. We now define it through Xf = X7, as 
follows: 


Pr(x,A) = P*{X7r € A}; Prf(x) = E*{f(Xr)} . 
(4.12) 


This notation is invented here for the occasion, it is not 
“standard” or “habitual”, but it will be seen to be most 
appropriate below. 
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Under the hypothesis (4.11) the developments in Sec. 3 for a 
constant time T can be carried over to a finite optional time 7, 
without change, beginning with (3.4) and ending with (3.9) and 
(3.10). A backward glance at the previous alternative method 
shown in (3.11) makes it abundantly clear why semigroup is 
now helpless because a random time T’ is simply not in its 
structure. 
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5. MARKOV PROPERTY 


Let D be a bounded domain in R°, D its closure, OD 
its boundary, D° its complement. The boundary is defined 
logically by DND¢. It has already appeared in Green’s formula 
(1.1), but in general it can be a tricky set, not necessarily of 
lower dimension than D, even less to have a surface area, as 
undoubtedly Green took for granted for his conductors. All we 
know is that OD is a compact set. Since D° is a closed set, its 
hitting time Tp- is optional as discussed in Sec. 4. We will call 
it the “first exit time from D” and denote it by Sp: 


Sp =inf{(0<t<o; X,€ D*}. (5.1) 


We will now prove that almost surely Sp < oo. To do so 
we need to strengthen the hypothesis (3.7) by bringing in 
the “past” with respect to the chosen time J, as well as the 
“future” that is already embodied in the post-T process. For 
each constant 7’, the Borel tribe Fr generated by the collection 
of random variables {X;, 0 < t < T} will be called the pre-T 
tribe; the Borel tribe generated by {X;, T < t < oo} will be 
denoted by F, and called the post-T tribe. The “present” Xr 
belongs to both tribes by this definition; so be it. The post-T 
tribe is of course just the tribe generated by the post-T process. 
We could have also defined a pre-T’ process, but this will not 
be necessary yet. The single random variable X7 generates 
a Borel tribe which will be denoted by the same symbol. 
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You should review the notion of conditional probability and 
expectation relative to a Borel tribe. The following hypothesis 
is the formal statement of the “Markov property”. 


Conditioned on X7, the tribes Fr and Fh 
are independent. (5.2) 


This means: for any A; € Fr and Ag € Fir we have 
P{A, NAo|Xr} = P{Ay|X7r}P{A2|Xr} . (5.3) 


In this form it looks neat (symmetric!) but it is not very 
convenient for use. A more handy form is the following: for 
any A € Fr, 

P{A|Fr} = P{A|X7} . (5.4) 


This is often described by words like: “the past has no after- 
effect on the future when we know the present.” But beware of 
such non-technical presentation (sometimes required in fund- 
ing applications because bureaucrats can’t read mathematics). 
Big mistakes have been made through misunderstanding the 
exact meaning of the words “when the present is known”. For 
example let T = 2, and A = {3.14 < |X3| < 3.15}, then (5.4) 
does NOT say that 


P{3.14 < |X3| < 3.15 | 3 <|X4| < 453.1 <|X9| < 3.2} 
= P{3.14 < |X| < 3.15 | 3.1 < |X| < 3.2}; 


namely that the past “3 < |X,| < 4” may well have an after- 
effect on the future when the present |X 4| is given as shown. 
Why? and what does (5.4) say? You must figure this out to 
understand what follows, correctly.’ 

We are going to use the Markov property to prove an 
important result on Sp, as follows: if D is a bounded Borel 
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set then 
sup E*{Sp}<oo. (5.5) 
ze R3 
For each T' = 1,2,...,n,..., the random variables X,, 


1 <n < T, belong to Fr, while X74, — Xr belongs to 
Fir. Hence by an application of (5.4) and using the (abused) 
notation introduced in (3.9), we have 


PIX¢eD telreoe rst 


=P{X,€D forl<n<T; P*T(X,e€ Dj}. 
(5.6) 


Note that the temporal homogeneity of {.X;} is used above as in 
(3.9). Note also that we have used P for E in the right member 
even though there is a random variable P**|...] under it. This 
mild flip in notation will be indulged from time to time. Now 
by (3.7), we have 


p*r [X € D\ = i p(1, Xr, y)dy . (5.7) 
D 
For Xr € D, this has the almost sure upper bound 
sup [rie yjdy=O0<1. (5.8) 
xzED 
D 


To see why @ < 1 it may be easiest to observe that when both 
z and y belong to D then |x — y| < M = the diameter of D, 
so that 
o= / pls, 2)de< 1 
|2|<M 


Using this in (5.7), we obtain by induction on T in (5.6) that 


VtEeD: P*{X,€Dforl<n<T}<6. (5.9) 
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Recall that the x in P® indicates oS = x. Now the set 
{Sp > T} is contained in the set ia {X, € D} [warning: 
Sp > T does not imply Xr € D)); ae 


Vee D: P*{Sp>T} <6". (5.10) 


For any random variable S > 0, its expectation satisfies the 
inequality 


S)< So P(S>n). (5.11) 
n=0 
[This is true even if P(S = 0) > 0, why?] It follows that 


sup E*{Sp} < yo" = = — < 00. (5.12) 
n=0 _ 

How to extend the sup above to x € R?° is left to you as an 
exercise, it is easy but remember D is an arbitrary Borel set.® 
An immediate consequence of (5.5) is that Sp < oo as. 
That is what we need for the immediate future. But let me 
pause to comment on the result in (5.5). For any Borel set 
D, we shall say it is “Green-bounded” when (5.5) holds; thus 
this is a condition on the set D that is broader (weaker) than 
ordinary Euclidean boundedness. Now Sp represents a random 
time (optional or not) and the sup of its expectation is a 
measure of the temporal size of D. Indeed, a little elaboration 
of the argument above yields an inequality valid in R?, d > 1: 


sup E* {Sp} < Agm(D) (5.13) 
xrERe 


where m denotes the Borel-Lebesgue measure and Ag is an 
absolute constant depending only on the dimension d (see [G]). 
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Many results valid for bounded sets can be extended to Green- 
bounded sets. As for the reason why it is called Green, it will 
be explained soon. 

The next major step is to combine the temporal homo- 
geneity hypothesis (3.7) with the Markov property in the form 
(5.3). For any s € Ro, t € Ro, we have for f € By, e.g., 


Psitf (x) = E*{f(Xo4t)} = BLE [f(Xi)]} 
= E*{P,f(X.)} = Ps(P:f)(2) (5.14) 


The second equation above holds as in (3.9) with T = s, and 
the rest is nothing but notation. The symbolism in E**[f(X;,)] 
is really horrid duplicity because if s = t the two X; in 
E**(f(X;)] do not denote the same thing, as already noted 
under (3.9). No matter; the result may be recorded opera- 
tionally as P,,; = P,P; and shows that {P;,t > 0} forms a 
“semigroup” with Po = the identity operator. It is called the 
transition semigroup of the process. It is NOT the process, as 
some semigroupers would like to think. 

We can express the joint distribution of any finite set of 
random variables from the process. It suffices to show this for 
four of them beginning with Xo. Thus let 0 = to < t; < tg < 
tz. Then we have for f; € By, i = 1, 2,3: 


E*{ fi (Xt, ) fo( Xt.) fa(Xts)} 
= Pr, (fiPt.—t, (fo : Prs—tef3))(£) 
= [f[vtrseet(er)olt — t1;21, £2) fo(z2) 
‘ p(ts as t; 22; £3) f3(x3)dxidx2dx3 (5.15) 


where the integrals are all over R°. Now the normal density 
function p(t; z, y) given in (2.1) depends on (z, y) only through 
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x — y (actually only |z — y}). Let us put 
pe(z) = p(t; ,0,z) = p(t; 2,2 + 2) (5.16) 


for any z; then the last member in (5.15) reduces to 


| f [rx (x1 — £0)Pty—t, (@2 — £1)Pts—tz (#3 — 22) 


x fi (21) fo(x2) f3(a3)dr,dredx3 (5.17) 


where J have written xo for x. Noticing that 7, = r@p_, + 
(fn —2n—-1), n = 1, 2,3, this shows that the three random vari- 
ables X;, — Xt,,_,, 2 = 1,2,3, are stochastically independent, 
and have respectively the probability densities p;,,-+,_,. Can 
there be any doubt that the same is true for any number of 
similar differences instead of three? Thus we have reached a 
fundamental property of the BMP, as follows. 


For any positive integer N and 0 = tp < ti < to < 

... < ty, the collection of N + 1 random variables Xo 

and X;, — Xt,_, for n = 1,2,...,.N, are stochastically 

independent with probability densities p(t, — tn_1;0,-) 

given in Eqs. (5.16) and (2.1). (5.18) 
In Einstein’s investigation, he wrote [p. 13]: 


We will introduce a time-interval 7 in our discussion, which 
is to be very small compared with the observed interval 
of time, but, nevertheless, of such a magnitude that the 
movements executed by a particle in two consecutive inter- 
vals of time 7 are to be considered as mutually independent 
phenomena. 


The editor R. Fiirth appended a note to the above [p. 97]: 


The introduction of this time-interval 7 forms a weak point 
in Einstein’s argument, since it is not previously established 
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that such a time-interval can be assumed at all. For it might 
well be the case that, in the observed interval of time, there 
was a definite dependence of the motion of the particle on 
the initial state. 


I have quoted the two passages above in order to show a 
sample of physicists’ way of using mathematical language. If 
we take Einstein’s words literally, the following conclusion will 
follow. Let t) < te < tz, and tg — t) < T, t3 —tg <7, then 
clearly the two intervals (t1,t2) and (t2,t3) can be put into 
two contiguous intervals both of length < 7, and so the move- 
ments in (¢1,t2) and in (t2,t3) are mutually independent. It 
follows by induction that when any interval (0, t) is divided into 
sufficiently small parts (of length less than 7), then the move- 
ments in all pairs of consecutive subintervals are independent. 
In Einstein’s statement he considered only “two consecutive 
intervals” and said nothing about three or more such intervals. 
Nowadays any student of elementary probability knows [or does 
he?] that three events can be pairwise independent without 
being “totally” independent in the current usage of the word, 
see, e.g. [E; p. 142]. Apparently it was Serge Bernstein, teacher 
of Kolmogorov and Khintchine* et al. who first gave such a 
“counterexample”. But Einstein probably did not care about 
such finesse and we must give him the benefit of doubt that 
by his mutual independence between consecutive intervals he 
really meant total independence among all the successive ones. 
If so, his hypothesis would be tantamount to (5.18) above. But 
then a caveat arises: there is no need of the 7 to begin with. 


* Apparently not in the literal sense, as A. Yu Veretennikov informed me, 
who also told of a case of inadequate domination forewarned on p. 9 above 
(March 2001). 
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The upshot seems to be: Einstein really had some more 
complicated model in mind which he did not make precise, of 
which the Brownian motion as formulated in (5.18) served as a 
first approximation.!° Actually he wrote his first paper on the 
subject before he had heard about Brown, and his investigation 
“proved” the physical existence of molecules. 

Let me also supply the hindsight regarding (2.9), a crucial 
step in Einstein’s argument. This amounts to the fact that the 
(statistical) variance of X(t) is equal to ¢, for all t > 0, or in 
the more vivid notation of stochastic calculus: 


dX(t) =v. 


Observe that the ¢ above is arbitrary, no 7 there. 


6. BROWNIAN CONSTRUCT 


We are ready to apply (4.11) with JT = Sp, where D is 
a Green-bounded domain. Since Sp is a finite optional time, 
Eq. (3.10) holds with T = Sp, namely: 


Sp 
‘| fX)at$+Ps,Gf(z). (6.1) 
0 


This is the key to the Brownian resolution of Green’s problem 
discussed in Sec. 1. The first term on the right-hand side above 
is an operator Gp defined as follows: 


Gpf(e e{/ I(Xs)d (6.2) 


for suitable f, as always. It will be called the Green potential?! 
operator for D. The expectation in (6.2) is defined for any 
domain D for a positive Borel function f, since the integral 
in (3.2) is then a well-defined F-measurable function, though 
not necessarily finite and so by certain conventions not always 
qualified as a random variable. When D = R?, then Sp = +00 
and Gp reduces to G. Thus the “ground potential” is just an 
extreme case of the Green potential. 

Next, let us examine the second term on the right-hand side 
of (6.1). By (4.12) we have 
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Ps, (2,4) = P*{X(Sp) € A} . (6.3) 


For each x € D, Ps,(z,-) is a probability measure, ostensibly 
on the Borel tribe B* (for R*). But since the path t > X(t) is 
continuous by (4.5), at the first instant it leaves D it must be 
at the boundary, namely 


vee D: P*{X(Sp)€dD}=1. (6.4) 


On the other hand, for each x ¢ D, we have P*{Sp =0}=1 
and so P*{X(Sp) = X(0) = x} =1. It seems dumb to bother 
with this banality, but it serves as a good reminder of the full 
force of the definitions. The last-written equation need not be 
true for r € OD = D\D, owing to our choice of “0 < t < co” 
instead of “0 < t < oo” in (5.1). This is a most serious matter 
which will turn up soon. 

To repeat, when z € D and Sp < o as we are assuming, 
the probability measure defined in (6.3) is supported by the 
boundary 0D. This measure is known outside probability 
theory as the “harmonic measure for D”, arrived at usually 
through operatoric (functional-analytic) considerations. Let us 
re-denote it by Hp(z,-), restrict x to D, and adopt a trendy 
notation as follows: 


Hp(x,dz) = P*{X(Sp) € dz}, (x,z)€ Dx (OD). (6.5) 


Now take any bounded Borel function f defined on OD, written 
as f € By (OD); then we have 


Hpf(x) = E*{f(X(Sp))}, ved. (6.6) 


The function Hpf is bounded because f is; is it measur- 
able? For some psychological reason this kind of measurability 
question is regarded by many as a bore but it is inescapable, 
unfortunately. Indeed in probability theory we need a little 
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more: the customary Lebesgue measurability is not serviceable, 
Borel measurability is necessary (why?). I will state a general 
result sufficient for our purposes, and indicate the steps of its 
proof, but leave the chores to you to carry out. After doing 
such exercises a few times, you may then wave hands next time 
around, may be. 

First of all, we must sharpen our previous assertion that the 
function (w,t) -> X(w,t) is (F x B)-measurable to its being 
(Foo x B)-measurable, where F,, is the tribe generated by all 
F,, 0 <t < oo, namely by the process {X;}. Then ifw > T(w) 
is F..-measurable, w + X(w,T(w)) is also F..-measurable. 
Any optional time T is ¥,,-measurable, hence so is Xr. A set 
in ¥,. of the form, where N is a positive integer, t, € Ro, 
Ay € B?: 


N 
() {Xt € An} (6.7) 


n=1 

is called a “cylinder set”. For any such set A, the function 
x — P*(A) is Borel measurable. For N = 3, this is seen by 
a direct inspection of (5.15), where we take fn = 14,; and 
the general case presents no problem. Now the cylinder sets 
generate the tribe F,., and Borel measurability is transmissible 
from them to the whole tribe. This is done by one of the 
“monotone class theorems” discussed in Sec. 2.1 of my Course 
(C]; the most convenient form to employ in this case, I think, 
is that given as Exercise 10 on p. 20 there, due to Sierpinski.}? 
Thus the final result is as follows: 

For any A € Fo, the function « > P*(A) 

is Borel measurable (in R*). (6.8) 
As two consequences of (6.8): for any Borel measurable f, 
Gpf defined in (6.2) and Hpf defined in (6.6) are both Borel 
measurable. 
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Much ado about rather little! For immediate compensation 
we state the major fruition: 


Hpf is harmonic in D. (6.9) 


Fix an arbitrary x in D; the following statements are true P*- 
a.s. Take any ball B = B(z,r) such that B C D. We know 
that the path starting at x will exit D for the first time at the 
time Sp, but before doing so it must exit from B, i.e., Sp < 
Sp, “stopping” momentarily at time Sg and place X(Spg). 
By (4.10), the post-Sg process {X;} behaves probabilistically 
exactly like the original process, but with the new starting 
place X(Sg). Its path too will exit from D at its hitting time 
of D° which is properly denoted by Si. We have by temporal 
homogeneity, for any y: 


EX f(Sp)} = E*{f(Sp)} = Hof ly) , (6.10) 


where in the first term the y in EY indicates Xj = y, whereas 
in the second term it indicates Xo = y. (We could have used 
E" for the former.) Thus, by tracking the path from x via the 
“stop” at X(Sg) until time Sp, we obtain 


Hp f(z) = E*{f(X(Sp))} = / P*{X(Sp) € dy} BY{ f(S')} 
zi y P*{X(Sp) € dy}Hpf(u) 


= / Ha(z,dy)Hpf(y) =He(Hpf\z), (6.11) 


where in the last two equations we have used (6.5) with B for 
D. We know from our general discussion above that Hp(z,-) 
is supported by the 2-dimensional sphere 0B = {y € R? : 
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|y — z| = r}, but we can say a lot more. It is the uniform 
distribution on OB: 
o(dy) 

Hp(a, dy) = , 
This is intuitively obvious from the spherical symmetry of the 
normal density (2.1), but a logical proof is more tedious than 
you might think because we are dealing with the random vari- 
able w + X(w,Sp(w)). Using (6.12) and (6.11), we arrive at 
the desired relation 


Hpf(e) = OB) } Hpf(yo(dy), (6.13) 
OB 


(6.12) 


for any x € D, and any ball B with center x and with 
closure contained in D. Such a function Hpf is said to have 
the “sphere-averaging” property in D. If we now use polar 
coordinates and Fubini—Tonelli’s theorem, we can integrate 
(6.13) along the radius to derive the “ball-averaging” property: 


Hpf(x) = =i} |) Hpf(y)m(dy). (6.14) 
B 


Note that we are using Lebesgue integration in two and three 
dimensions, respectively, in (6.13) and (6.14), and his big theo- 
rem that a bounded measurable function is always integrable. !* 
If we did not know that Hpf is (Borel) measurable, we could 
not even write down (6.13). Now it follows from (6.14) that 
Hpf is a continuous function in D. This is a nice exercise, 
not hard, that you should try to do yourself; actually with 
a little trick one can even prove that any function having 
the sphere-averaging property, and integrable over balls, is 
infinitely differentiable in D, written as C°(D), but we do 
not need this here. 
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What have these properties to do with harmonicity? It 
was Gauss who established the sphere averaging property for a 
harmonic function; but it was later that Koebe (also Bécher for 
R?) showed the converse, more precisely: a continuous function 
which has the sphere-averaging property is harmonic (all in 
D).* The ancients did not know “measurable”, but who needs 
it who had “continuous”? It is Koebe’s part of this remarkable 
characterization of harmonicity that we need here to conclude 
that our Brownian construct Hpf is harmonic in D for any 
f € B(O0D). As a matter of fact, one can even omit the 
boundedness of f, provided that the mathematical expecta- 
tion in (6.6) be well-defined (what does this mean?) and that 
it be finite at some point (one is enough!) «x in D. For this 
impressive generalization see [L; p. 154ff.]. 


* * . . : . 
Koebe’s theorem is an exercise in calculus using Gauss’s divergence for- 
mula, a good companion to Green’s formulas. 


7. THE TROUBLE 
WITH BOUNDARY 


We now consider the boundary values of Hp f. Suppose we 
were to change the definition of Sp given in (5.1) one hair by 
replacing the first “<” in inf {0 <t<oo:...} with “<”; then 
for any z € OD we should have trivially 


P*{Sp =0}=1, (7.1) 


and therefore Hp f(z) = E*{f(X(0))} = f(z). In other words, 
Hpf would have the given boundary value f on OD. In 
particular, if f is continuous, ie. f € C(OD), then since 
Hpf € C(D) and Hpf = f on OD, it would be reasonable 
to expect that Hp f € C(D), namely that it is “continuous up 
to the boundary”. Astonishingly, this is false in general. 

The point of the “digression” (is it?) above is that even if 
we tried to cheat a little to “make life easier”, the real problem 
would not go away. So we will return to the original definition 
of Sp and face the difficulty. Given D and a point z on its 
boundary OD, we shall say that D is “regular” at z, or that z 
is a regular boundary point of D,' in case 


E*{Sp} =0 (7.2) 


which is equivalent to (7.1). We shall say that D is regular if D 
is regular at every z € OD. A simple example to the contrary 
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is the punched ball D = B(0,1)\{0}, where 0 is an isolated 
boundary point. To see that D is not regular at 0, we need an 
important property of the BMP, valid in R¢, d > 2: 


Ves. PPT S00} 20 (7.3) 


Recalling the definition in (4.2), this means the point {0} is 
almost surely never hit. Owing to the spatial homogeneity of 
the process, the same is then true for any other point. We 
shall need this result again and it will be proved in Sec. 10. 
Accepting this for the moment, it is obvious that any path 
starting from 0 cannot hit 0D = OB(0,1) U {0} right away, 
hence Sp > 0 a.s. This is much stronger than the negation of 
(7.2). It turns out that if D is not regular at z € OD, then we 
must have 

P?{Sp >0}=1. (7.4) 


This follows from a zero-or-one law due to R. M. Blumenthal, 
a doctorate student of G. A. Hunt’s. 

We are ready for the Boundary Convergence Theorem for 
a Green-bounded domain D, and f € By,(OD). If D is regular 
at z € OD, and f is continuous at z, then we have 


lim E*{|f(X(Sp)) — f(X(0))]} = 0 (7.5) 


where x varies in D. This is a form of the solution of Dirichlet’s 

boundary value problem, apparently unnoted outside prob- 

ability, which is actually equivalent to (7.16) below (why?). 
The first step is to prove 


lim E*{Sp} = E*{Sp}=0, (7.6) 
where now xz may vary in all R* (why the difference between 


(7.5) and (7.6)?). Define an approximation of Sp, for each 
E> 0: ; 
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Sp =inf{e<t<oo: X:€ D*}. C70) 
Then as ¢ | 0, S$ | Sp and 
E*{Sp} = lim | E*{Sh} . (7.8) 


This requires a proof! Applying the post-e temporal homo- 
geneity, using (6.1) and observing that E*{Sp} = Gp1(z) by 
(6.2), we obtain 


E*{Sh} = E*{B*[Sp]} = P.(Gpl)(a). (7.9) 


Since Gpl is bounded by (5.5), this shows the function in 
(7.9) is also bounded. Its finiteness at x for some ¢ is suffi- 
cient for (7.8), by dominated (sic; not bounded or monotone) 
convergence. 

Now is the occasion to announce another important 
property, this time not of the Brownian process, but only its 
transition probability, as follows. 


For each f € B,(R*) anyt >0: Pf © Cy(R*). (7.10) 


This is known as the “strong Feller property” — a misnomer 
because it is not stronger than another property called the 
“Feller property”. The latter is fundamental for the construc- 
tion of a class of Markov processes called “Feller process”, 
precursor of Hunt process; see [L; Sec. 2.2]. The BMP is a 
special case of Feller process. It turns out that the conjunc- 
tion of the two Feller properties, which I call “Doubly-Feller” , 
possesses remarkable self-preserving qualities [Chung D]. Since 
(7.10) is a purely analytic result, its easy proof is left as an 
exercise. 

Returning to (7.9), we see that for each e > 0, E*{S5} is 
continuous in x. Hence for any z, we have 


Tim E*{Sp} < lim E*{S§}=E*{S}, (7.11) 
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and then by (7.8) : 
lim E*{Sp} < lim E*{Sp} = E*{Sp}. (7.12) 
Lz E 


Since z is arbitrary, this means that as a function of z, E*{Sp} 
is upper semi-continuous in R¢. In general, it is not continuous. 
But under the condition (7.2), it is so at z, and that is what 
we need. 

Heuristically, the result (7.6) should mean “in some sense” 
that Sp goes to 0. Suppose f is continuous on all OD, then 
since X(-) is continuous, it should follow that f(X(Sp)) goes 
to f(X(0)) “likewise”. At least one book-author seemed to 
think so and left it at that.t° “What’s the problem?”. 

One problem has been forewarned in Sec. 4 in the defini- 
tion of X(Sp); it is w + X(w,Sp(w)), a random function 
of a random function. It does not behave as nicely as say 
y(Sp(w)) where y is a non-random function. So the con- 
tinuity of t + X(w,t) must be used with circumspection. 
Another more obvious problem is that we are dealing with 
the convergence of E*{...} as x varies, a kind of “weak 
convergence” but not quite. As far as I can see, there is no 
ready-made convergence-mode to enable us to draw (7.5), or 
(7.16) below, from (7.6). We have to go back to basics and put 
down a few epsilons and deltas. 

Define a random (optional) time for each r > 0. 


Zr =inf{0<t<oco: |X;—Xo|>r}. (7.13) 


This is the first time that the path (or Brown-—Einstein’s 
particle) moves a distance r. By spatial homogeneity, we have 
for all t > 0: 


PAZ St Pet eh, (7.14) 
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The exact distribution of Z, is known but it is complicated and 
scarcely serviceable. Fortunately we need only the “mini info” 
that Z, > 0 (a.s., why?). This is equivalent to 


. 10) < a 
lim P°{Z, < t} =0 (7.15) 


(why?). Now let ¢ > 0 be given; there exist three positive 
numbers r, 7, 6 all depending on ¢ and each depends on the 
preceding one; such that 


(i) sup |f(y)—f(z)| <e; 
Woe 
(ii) sup P°{Z,<th<e; 
x—2z\|<r 
(iii) sup sup P*{Sp>T}h<e. 
ju—z|<r |e—z|<6 


Continuity of f at z yields the r in (i); then (7.15) yields the 
T; then (7.6) plus Chebyshev’s inequality yields (iii). 
Hence if |2 — z| < 6, then by (ii), (iii) and (7.14): 


P°{Sp > Z,} = P* {Sp > 2,32, ST} + P° {Sp > Z,,Z, > T} 
< P°{Z, <7} 4+ P {Sp >} <et+e=2e; 


consequently 
E*{|f(X(Sp)) — f(X(0))|;._ Sp 2 4,} S2Me, 
where M = sup|f|. If |x — 2| <r, then we have by (i): 
E*{|f(X(Sp))- F(X); Sp< 2} <e 


because on the set {Sp < Z,}, |X(Sp) — X(0)| <r. 
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Combining the two cases, we conclude that, if |z—z| < dAr, 
then the sum of the two expectations is < (2M +1)e. This 
establishes the result (7.5). Is the argument too long? See 
Addenda I below. 

A trivial Corollary is the usual form of the boundary con- 
vergence: 


lim E*{f(X(Sp))} = lim E°{(X(0))} 


= lim f(z) = f(z) . 


Lz 


(7.16) 


We have solved the classical Dirichlet boundary value prob- 
lem. Let us denote it by (D,f), where D is a bounded and 
regular domain, and f € C(OD). The problem is to find a 
function h such that h € C(D), h be harmonic in D and h = f 
on OD. A solution is given by the Hpf defined in (6.6). It 
is equal to f on OD. It is harmonic in D by (6.9); and as a 
special case of (7.16), it converges to f at every point of OD, 
hence it is continuous in D. 

In fact, Hpf is the unique solution of (D, f). This is equiv- 
alent to the assertion that the identically zero function on D is 
the unique solution of the Dirichlet problem (D,0) where 0 is 
the identically zero function on 0D. The proof follows from the 
ball-averaging property of a harmonic function given in Sec. 6, 
and is commonly referred to as the “maximum principle”. Here 
is a large hint: how can the average of a ball of values be equal 
to its maximum or minimum? You should carry out the nice 
little argument yourself, purely analytic. 

Dirichlet’s problem may not be solvable without the condi- 
tion of regularity (missed by Dirichlet as well as Gauss). The 
previous example of the punched unit ball will serve. Define 
f =1 on OB(0,1) and f(0) = 0; then f is continuous! Both 
the functions h, and hz, where hi(x) = 1, he(x) = ae are 
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harmonic in D = B(0,1)\{0}, and have the correct boundary 
values on 0B(0, 1), but miss being a solution at 0. A little trick 
(see Sec. 10 below) shows that there is no solution. Mathemati- 
cians at an earlier time must have disdained this example as 
not being a regular one (pas normal), but they had to accept 
a later construction by Lebesgue, known as a thorn or spine, 
as irrefutable. By the way, it was not so easy to define a 
regular boundary point as we did in (7.2), without Brownian 
theory, but it was done, by “barriers”. The probabilistic 
solution was given first in 1944 by Shizuo Kakutani. He did 
it in R? and considered a Jordan domain (which is always 
regular); and instead of a continuous f he considered an arc on 
its boundary, so that he could construct the harmonic measure 
as given in (6.5), “in the sense of Nevanlinna”. In his article 
he chose to omit some parts of the proof as being too delicate 
to be explained. As stated in Sec. 4, Wiener [D] proved the 
legitimacy of (4.5); a year later he [T] solved the generalized 
Dirichlet problem (which is essentially the assertion (7.16) for 
regular points, without regard to irregular points). But he did 
not employ the BMP in his work on the Dirichlet problem. 
There was hardly time and his notation did not provide room 
for a random time. He did give a necessary and sufficient 
condition for regularity, using Newtonian capacity which will 
be alluded to in Sec. 11. It is odd that in the later book 
by Paley and Wiener (1934), another proof of (4.5) was given 
by Fourier transform methods, but no mention was made of 
Wiener’s 1923 paper [D]. 
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8. RETURN TO GREEN 


As reviewed at the beginning, Green was intent on the 
representation of a harmonic function by its boundary values, 
and used a heaven-made solution of a particular boundary 
value problem to obtain his formula. From the Brownian point 
of view the representation problem is easy! Let h be harmonic 
in a bounded and regular domain D. Surely Green would not 
mind it if we suppose that h is continuous up to the boundary, 
namely h € C(D). Then pronto! 


EDs ABS : P®{X(Sp)€dz}h(z) (8.1) 
oD 


is the representation sought. Why, it is just h = Hph because 
the latter is the unique solution of the Dirichlet problem (D,h), 
whereas the given h is also a solution. 

Observe that when D is a ball and z its center, (8.1) reduces 
to Gauss’s spherical averaging. Thus Green’s representation is 
a weighted averaging for a general D, given by the probability 
distribution of X(Sp) — a big generalization of Gauss’s 
theorem. 

As a matter of fact, for a ball in any dimension, an explicit 
analytic formula is known as Poisson’s. Let B = B(0,r) in R%, 
d > 1, and put 
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Then we have 
P*{X (Sp) € dz} = Hp(z,dz) = Kp(z,z)o(dz). (8.3) 


When d = 2, and x = (p,0) in polar coordinates, (8.1) 
becomes 


1 r2 — p? 
M08) = 3— | apo soy re Mn? 
0 
2 


= ft; + 3 (2) cosn(o - a] h(r, p)dy . 
(8.4) 


This can be derived from Cauchy’s Integral Formula in complex 
function theory, which is a representation of a holomorphic f 
by its boundary values: 


fae [ha 


Ori Lr 2 
C 


] 


see Titchmarsh [pp. 124-5]. I don’t know if Green knew 
Cauchy’s theorem. He did not mention it in his Essay. 

When d = 1, D = (-1,+1), OD is the two-point set 
{—1,+1}. It is remarkable that (8.2) still works and yields 


Rei) 5 ao. Ce Sa s(t ty “eee 
(8.5) 
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with o({—1}) = o({+1}) = 1. To be explicit, we have 
P*{X(Sp) = #1} = 5(1F2) ; 
so that (8.1) becomes 
h(x) = 5(1—a)a(-1) + 5(1+2)a(+1) 


h(+1) — h(—1) h(+1) + A(-1) 
= c+ 5 
2 2 

This is the equation of the straight line passing through the 
two points (—1,h(—1)) and (+1,h(+1)). Euclid says there is 
exactly one such line. Since a harmonic function is a linear 
function in R1, and vice versa, the calculations above amount 
to a probabilistic solution of the geometrical problem of repre- 
senting the line, via Descartes’ cartesian method. Quite a de- 
nouément. Since we did not prove (8.2) here, let me say that in 
R!, the requisite Poisson kernel Kg, namely the evaluation in 
(8.5), can be done by probability argument (how?). Apropos, 
there is a challenging open problem regarding (8.2) for d = 2 
and d = 3, or more specifically (8.4). As already said before, 
and clearly indicated in (8.3), the determination of Kg(z, z) 
is that of calculating a nice distribution that has a simple 
geometric meaning. It ought to be solvable by direct methods 
of “geometrical probability” (formerly a well-known speciality, 
reported e.g. in the Encyclopedia Britannica), without the 
seemingly involuntary involvement with harmonicity et cetera. 
But apparently there is no easy way. 

The proof indicated above for Green’s representation (8.1) 
uses the full force of the unique solvability of the Dirichlet prob- 
lem (D,h), hence it requires the regularity of D. Curiously, 
the latter hypothesis is dispensable so far as representation 
is concerned. Here is the argument well worthwhile for its 


-l<ax<4+l. 
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general probabilistic flavor. Although D itself may not be 

regular, it can be approximated by regular subdomains. In 

fact standard geometrical consideration yields a sequence of 

“smooth” subdomains D,, such that Dn C Dn41 for alln > 1 
CO 

and that J D, = D, namely D, tt D. The qualifier 


n=1 
“smooth” here means that the surfaces OD,, have tangent 


planes et cetera. Such a domain can be shown to be regular. 
In fact, there is a more general criterion known as Poincaré— 
Zaremba’s “cone condition”, which has recently been flattened 
by a Brownian argument and thereby made considerably easier 
to apply, see [L; p. 165]. Anyway, we can apply (8.1) to each 
D,, to obtain a (the) representation of h in D,, which we will 
write in probabilistic form: 


VaE Dn: h(x) = E*{h(X(Sp,))}. (8.6) 


As n + oo, Sp, + Sp (proof?), X(Sp,) + X(Sp) and 
h(X(Sp,,)) > h(X(Sp)), by continuity of paths and continuity 
of h up to the boundary. Since h is bounded in D, Lebesgue’s 
bounded convergence theorem allows us to pass to the limit 
under E” to obtain 


VaeED: h(x) = E*{h(X(Sp))} (8.7) 


which is (8.1). It seems odd that the regularity of D is not 
needed in this result. There is a hidden deep reason behind 
it. According to a general result of Kellogg—Evans—Doob-— 
Hunt [L; p. 223], although there may be irregular points on 
OD, almost no path will ever hit them. Thus they are not 
really there so far as the paths are concerned. Earlier we 
mentioned that a singleton is never hit, which is a special case 
of this. A set so small that almost no path can hit it is called 
a “polar set”; it is also identifiable as a set of “Newtonian 
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capacity” zero. Now that we have dropped this name, let us 
pause a moment to reflect on some consequences. Since each 
singleton is polar, and it is trivial that a countable union of 
polar sets is still polar, it follows that the set of all points in 
R° whose three coordinates are rational numbers is a polar 
set. They are everywhere dense, as you well know. Conse- 
quently almost no Brownian path will ever encounter any of 
them from time zero to eternity. Don’t forget that all paths 
are totally continuous. How is it possible for them to twist and 
turn around the everywhere dense set without ever hitting it? 

The above is a digression. Now let us return again to 
Green’s own doing. Using the notation in Sec. 1, Green’s spe- 
cial boundary value problem is the Dirichlet problem (D, gy), 
where y € D and g,(x) = g(x,y) = gga as in (2.3), 
and the solution is hy, which is Hpg, in our new notation. 
The notation is confusing here owing to the duplicity of the 
functions involved which are all symmetric in (x,y). Let us 
write it out more explicitly: 


i 


ee 8.8 
2n\z — y| ea 


1 
5, Mey) = | Ho(ea2) 
aD 


We now define the Green (or Green’s) function for D as follows: 
1 1 


ee DxD. (8. 
a ar 5 Ua,y), (ty) € Dx (8.9) 


9D (x ’ y) = 
This was already spoken of in (1.10), but here the constant 
has been fixed. Be warned that other constants are adopted 
by other folks. You should check that the Green function is 
the density kernel of the Green operator in (6.2), namely: 


CsldG) SGs@ are i go(z,y)dy (8.10) 
A 
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for every Borel set A. For each z, the measure Gp(z,-) is 
supported by D, as seen from (6.2). 

Now let y € (D)°, then u,(-) € C(D) and uy is harmonic 
in D. Therefore by the representation theorem just proved, we 
have uy = Hpuy in D, namely 


iL 1 : 
=a = | Holedy zED, ye(D)*. (8.11) 
oD 


This relation has an electric interpretation as follows. If a + 
charge is placed at x inside the metal conductor surface 0D, it 
induces + charges on the outer side of 0D and — charges on the 
inner side of 0D which are equal everywhere on OD, and their 
distributions are given by Hp(x,dz). Since they cancel out 
nothing happens owing to their presence. When the conductor 
is grounded, however, all the + charges on the outside of 0D 
flow away to the ground, and only the — charges on the inner 
side of 0D remain. Now the original + charge at x, combined 
with the induced — charges distributed as H(z,dz) on OD, 
produces a potential at each point y inside D, which is strictly 
positive and equal to gp(z, y) as shown in (8.9) — never mind 
the true physical constant instead of the = there. On the 
other hand, at each point y outside of D the two cancel out 
exactly and the phenomenon, easily observable experimentally, 
is mathematically shown in (8.11). 

We are going to give a better derivation of the result 
(8.11) that is more intuitive and goes back to the fountain 
head. [What’s intuition to one may be frustration to another! | 
Take a small ball B(y,r) lying outside D, and substitute its 
indicator 1g for the f in (6.1). Reverting momentarily to 
the old notation Ps, to remind us of the meaning of things, 
we have 
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Sp 


1 

fer [relxoae 
lx —y| 

B (0) 


1 
+ f Pooled) fay 
B 


aD (8.12) 


For all 0 < t < Sp, we have X1 € D , hence X; ¢ B and 
1p(Xt) = 0. Thus the middle term above is zero and (8.12) 
reduces to 


1 i / 1 
— dy = | Ps,(2z,dz ——dy . 8.13 
Eee : soledz) | a oy 
B aD B 


Now both members of (8.11), for each x € D, are continuous 
functions of y in (D)°. Therefore if we shrink B to its center 
y in (8.13), we obtain the desired (8.11). 

In this new demonstration no “harmonic” was mentioned, 
and so it is independent of any results about harmonic func- 
tions. Only the post-Sp temporal homogeneity of the BMP 
and the continuity of its paths play the essential role. Hence 
the argument can be easily generalized to other processes and 
other potentials, such as Marcel Riesz’s. 

An explicit analytic formula for Green’s function is rare. 
For a ball, however, a simple geometric trick due to Kelvin 
(Thomson), known as “inverse in a sphere”, does it. For any 
y € R3, its inverse in the sphere 0B(0,r) is defined to be 


r 


y* = —sy. 8.14 

ly|? ee 
Thus y* is the point on the line from 0 to y such that |y||y*| = 
r?. The Green function for B = B(0,r) is then given by 


gp(2,y) = x ( : Woo) (8.15) 


2a \ja—yl lyl jz —y"| 
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To prove this it is sufficient, in view of (8.9), to show that the 
function A(x, y) defined for each y € B by 


: 
h = ——————_ B 8.16 
=a? 8S (8.16) 


is the solution of the Dirichlet problem (B, g(-, y)). Since y* € 
(B)°, 9(-,y*) is harmonic in B by (1.4), and g(-,y*) € C(B). 
Therefore by (8.16) the function h(-,y) has also these two 


properties. Now comes the geometry illustrated in the figure. 


The two triangles (0,z,y*) and (0,y,2*) are similar, because 
Ox - 0z* = Oy - Oy*, hence 


lo} fe—o"| _ yt 
at yar] el oe) 


Consequently we have the remarkable symmetry below: 


gp(z,y) = gay, 2) . (8.18) 


As a particular case of (8.17) when z = z € OB(0,r), so that 
z = 2* and |z| =r, we have 

lal _ le=y"| 
lvl yal” 
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which yields h(z,y) = g(z,y). Thus h(-,y) has the assigned 
boundary values and so it is the aforesaid solution. This estab- 
lishes the formula (8.15) — neat! 

Following Green’s foot-steps sketched in Sec. 1 but un- 
scrambling the notation in (1.9), (1.10), (8.3) and (8.9), we 
arrive at Green’s own representing kernel 


1 0 
5992 lye xeéB, yEeB, z€0B. (8.19) 


Comparing this with (8.2) with d = 3, we conclude that the 
quantity above must be equal to 
r? — |x|? 
feaae : (8.20) 
To carry out the differentiation should be a soul-satisfying 
experience which is left to you. [I only checked the easy case 
for x = 0 to make sure that the negative sign in (8.19) in OK.] 
My re-telling of the Green story must end here, but a few 
more words are in order for the amazing (8.18). Incredibly, this 
symmetry of Green’s function persists for a general domain in 
any dimension. Did physicists such as the redoubtable Faraday 
discover this phenomenon experimentally, or did they possess 
the electrical perception to see that it must be so?!” From our 
Brownian point of view, it is equivalent to the relation below: 


Bey} =P iescap TED VED. 
(8.21) 


There does not seem to be any rhyme or reason for the paths 
to behave so reciprocally. 

But we can derive the symmetry of Green’s function from 
Green’s identity (1.1)! Recall (8.9) but rewrite it as 
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i) 
G = ———__— 
where h,(-) is harmonic in D. Recall also the calculations 
leading from (1.5) to (1.6) which can be recorded symbolically, 


or in the distributional sense as 


hy (x) (8.22) 


1 1 
A— =-—41d9 or A——— =-—4nby, ; 8.23 
al =a be wey 


see Note. 2. It follows from (8.22) and (8.23) that 
AGy = —26y — 0 = —26, . (8.24) 


Since x and y are arbitrary points in D, we can of course 
interchange their roles in the above. Thus we have by (8.24) 


i CRG Gia). (8.25) 
D 


Interchanging x and y we have 


/ GyAGzdm = —2G,(2) . (8.26) 
D : 


Now apply (1.1) with u = G,, v = Gy; then both wu and v 
vanish on the boundary because that is how Green constructed 
them. Therefore the right-hand member of (1.1) is zero, and 
the identity yields by (8.25) and (8.26): 


~2G,(y) + 2G, (a) = ii (G,AGy — GyAG,)dm =0. (8.27) 
D 


Thus we obtain Gz(y) = G,(x), the symmetry of gp(z, y) as 
claimed. 

While the argument above lacks rigor in that it ignores the 
necessity of a smooth boundary and the existence of the normal 
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derivatives in Green’s identity, it should serve in electricity and 
magnetism to let us see the forest together with some of the 
leaves. Green could have done it. I wonder why he had to cite 
Professor Maxwell (see Note 17)? 
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9. STRONG MARKOV PROPERTY 


The Markov property discussed in Sec. 5 is for a constant 
time T; it will now be extended to an optional time. For this 
purpose we need the pre-T’ tribe as well as the post-T tribe 
which has already been defined. How shall we describe an 
event which precedes the random time T? If 7 takes only 
rational values it is natural to consider an event of the form 


Lir=rj}na, 


reQ 
where A, € F,. But if we wish to interpret the prefix “pre-” in 
the strict, sense, then we should require A, € F,. = Vo Fs. 
O<s<r 


Next, for a general T an approximation seems in order, but 
should we approximate it from the left or the right? It turns 
out in fact that there are a number of variations worth con- 
sideration. If you are curious you may consult |L, Sec. 1.3], or 
a fuller treatment in [O]. There are a lot of hair-splitting in 
the field! 

We shall opt for a quick definition as follows. The pre-T 
tribe is the collection of subsets of F such that for each ¢ > 0, 
we have 

ANAT Si} e Fp. (9.1) 


We may even put t = oo in (9.1). Owing to the definition of 
optionality given in Sec. 4. the collection is a Borel tribe and 
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is contained in F,,. When T is a constant, it turns out to be 


Fr+= | fy. For this reason it will be denoted by Fr4 in 
T<t<oo 
general. You should verify the preceding assertions; and also 


that both T and Xr are Fr,-measurable.!® The last result 
requires the (right) continuity of X;, in t. 

Now we can state the extension of the Markov property in 
the form of (5.4). 

For any finite optional time T, and A € Fr: 


P{A| Fri} = P{A| Xr}. (9.2) 


Combined with the post-option hypothesis in (4.11), this is 
called the Strong Markov Property.1® 

Up to here its full force has not been used. The main object 
of this section is to complete the picture and to give a simple 
but telling illustration. 

We begin with the relation 


Sp SB SD 
a4 p(X¢e)dt i p(Xz)dt f o(Xx)dt 
e° =e eos (9.3) 


where Sp and Sz are the same times as in (6.11), and y is a 
suitable function. It should be clear that the first exponential 
factor on the right-hand side of (9.3) belongs to the pre-Sg 
tribe, the second to the post-Szg tribe. Applying (9.2) in the 
“function” (instead of “set”) form, we have 
Sp Sp 
f e(Xe)de J e(Xe)at 
Eé es |\Fsnt ? =E\ |X(SpB)? (9.4) 


Applying the post-Sg homogeneity, we see that the right-hand 
member of (9.4) is equal to 
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Sp 
J e(X)at 
EX(S8) 2 0 (9.5) 
Sp Sp 
If you do not understand why the f there has become J 
Sp 


here, you should re-read the explanations leading to (6. 11), 
The situation has become a little more complicated but the 
idea is the same. Let us now take the indicated conditional 
expectations of both members of (9.3): 


SD 
f e(Xe)dt 
E® { e0 | Font 

SB SD 
f e(%.)at J o(Xz)at 

=e? E* ees | Pgs: 
Sp Sp 
f e(X)at f e(Xt)de 

=e? EX(S8) & @0 ; (9.6) 


y (9.4) and (9.5). Put (with apology tothe previously used wu) 


Sp 
; p(X )dt 
u(z) = B* < ee F (9.7) 


Taking unconditional expectation in (9.6), we obtain 
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f o(xaat 
u(2)=E?< €9 u(X(Sp)) > . (9.8) 


When y is a negative number —A, » > 0, we have 
Ula) = tere (9.9) 


the Laplace transform of Sp which uniquely determines the 
probability distribution of Sp. 

What does (9.8) signify? To appreciate it we need go a 
step farther as follow: take a suitable f and insert f(X(Sp)) 
as a posterior factor on both sides of the relation (9.3), and 
then go through the operations above, which requires no new 
argument. Now put 

Sp 
f e(X)de 
up(x) = E* ¢ e° f(X(Sp)) ? . (9.10) 


Then we obtain exactly the same result (9.8) with uy replacing 
the u there. (Isn’t it odd that the posterior factor has no effect 
on the equation?) With this amplification, we obtain 


f oxo 
u(x) = EB” (e0° uf (X(SB)) ‘ (9.11) 


When y = 0, this reduces to an old acquaintance. Find it. 
Instead of a ball B in the above, we may take any sub- 
domain with its closure contained in D. Then if us € C(D), 
(9.11) will hold even with the B there replaced by D, by the 
same argument in the passage from (8.5) to (8.6). Thus we 
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get a representation of us by its boundary values in a certain 
sense. Observe that in the exponential factor the past of the 
process is carried up to the time of exit from the boundary. It 
is by no means evident that as z goes to the boundary that 
past will leave no “after effect” on the limit. 

We have delayed a vital question here: What is a suitable y 
in (9.7)? To see the seriousness of this question consider y = 1: 
then u(x) = E*{e°°}. Of course by Fubini~Tonelli we have 


x == 1 x ve7 
E*{e"?} =} E*{Sp} 
n=0 


and if D in Green-bounded then E*{Sp} is bounded (by defi- 
nition); but what about those higher powers? Thus even if we 
suppose D and ¢ to be both bounded, we are far from know- 
ing if the u in (9.7) is finite. To stress this point, suppose we 
replace the random time Sp by a constant time T. Then if |y| 
is bounded by M we have trivially 


T 
f e(Xe)at 
E® ¢ © ieee 


But can we substitute Sp for T in the above by making use 
of the distributional properties of Sp? We know for instance 
that P*{Sp > T} decreases exponentially fast as T + co (the 
famous “exponential decay” of radio-activity!), and after all, 
the exponential function in the formula is “simple stuff”. Years 
ago I asked Rick Durrett (who was then a student in my class) 
to investigate the possibilities. He made a valiant attempt to 
concoct a large-deviation connection. It did not work. I was 
informed more recently that a third stage of the deviationary 
tactics was still in the offing. Anyway it is clear that there is a 
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world of difference between a constant J’ and the random Sp, 
exactly the difference between a semigroup and the stochastic 
process, already spoken of in Sec. 4. In fact, except in the banal 
case where y < 0, which was much studied by Mark Kac, we 
know of no viable condition to ensure that u(x) < oo. On the 
other hand, there are necessary and sufficient conditions on D 
and y for this. Sounds odd? Not really because “necessary 
and sufficient” means tautological and therefore may just give 
us a run-around. Instead, I will now give the following viable 
answer. 

Let D be a bounded domain (in R?, d > 1) and let ¢ satisfy 
the condition below due to Aizenman and Simon: 


t 
lim sup | Pelol(aas =0. (9.12) 
t10 zeRd 

0 


Then we have the following all-or-nothing criterion. Either the 
function u in (9.7) is equal to +oo everywhere in D, or it is 
bounded not only in D but in D. This function is called the 
gauge for (D, ). 

Consequence: In case the gauge is bounded, it is continuous 
and > 0 in D, and satisfies the equation 


(Fte)uxo (9.13) 


in the “weak” or distributional sense. Furthermore, the func- 
tion uy in (9.11) also satisfies the equation (9.13); and if D 
is regular then ug € C(D) and it is the unique function hav- 
ing these properties, namely uy is the unique solution of the 
Dirichlet boundary value problem for the new equation (9.13). 
Note that the gauge is just u;. Note also that when ¢ is the 
zero function we are back to our previous occupation. 
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It is trivial that any bounded Borel function satisfies 
the condition (9.12). It is more fun to check that the famous 
Newton—Coulomb function |z|~* also satisfies the condition. 


The equation (9.13) is called Schrédinger’s equation but in 
truth it is only a real fake because he had a factor i = /—1 
in front of the y in (9.13), as well as some physical con- 
stant (Planck’s?) which makes it “purely imaginary” — called 
“quantum”. Probability theory still lives in the world of reals 
and does not know how to handle that imaginary stuff. In 1949 
I heard a talk by Richard Feynman”? on this topic, and the 

t 
f e(Xs)ds 
exponential e° is now commonly known as Feynman’s 
“path integral”, which is a double misnomer because firstly he 
(like Schrédinger) had an i = \/—1 there; secondly, what he 
called path integral is nothing but our elementary E = P over 
the path space {w € 2: X(w,-)}, not the true path-by-path 
t 


integral [ y(X5(w))ds. By the way, he considered only con- 
0 


stant t there but quite possibly would not mind randomizing 
it when his calculations warranted it. 

The story is too long to tell here. Suffice it to say that 
for a “gaugeable” pair (D,y) a probabilistic theory has been 
developed for the Schrodinger extension of the Laplacian 
ideenkreis begun in Sec. 1 of this book. A short exposition 
has been given in Sec. 4.7 of [L]; a fuller one in [F] which has 
just been published. 

With the enunciation of the strong Markov property we 
have completed the gradual unveiling of the character of the 
Brownian Motion process. It is a strong Markov Process with 
the transition density function given by (2.1) in the Euclidean 
space R%, and with continuous sample paths. The “strong” 
assertion actually follows from the rest of the characterization. 
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The older characterization embodied in (5.18) can be shown to 
be an equivalent one owing to the spatial homogeneity of the 
transition density; see Sec. 4.1, Theorem 4 of [L]. 


10. TRANSIENCE 


The BMP in R? has a property radically different from 
that in R+ or R?. It has been given the name “transience” or 
“non-recurrence” but it can be more simply described by: 


P { lim |X(t)| = +00} =1, (10.1) 


This result has a famous discrete analogue originating in 
Pélya’s Bernoullian random walk (1921), and extended to a 
general random walk, see [C; Sec. 8.3]. The latter implies at 
once the following property of the BMP in R%, d > 1. For each 
r >0 and 6 > 0, we have 


P{|X(nd)| >r for all sufficiently large n} 
0, ifd=1 or 2; 
~ { 1, ifd>3. 

For d = 1 or 2, this is a stronger assertion than 

P{|X(t)| >r for all sufficiently large t} = 0 ; 
while for d > 3, it is a weaker result than 

P{|X(t)|>r for all sufficiently large t} =1. 
Thus (10.1) does not follow from the discrete analogue 


V6 >0: Pf{ lim |X(nd)| + oo} =1, (10.2) 
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which can of course be immediately strengthened to a count- 
able set of 6 such as the rational numbers. One would expect 
to be able to squeeze out from the latter a proof of (10.1) 
by approximation and the continuity of X(-). Surprisingly, 
there is no easy way. First of all, it is “easy” to construct a 
continuous f from Rp to R* such that Jim, f (nq) = +00 for 


every rational g, while lim f(t) = 0.21 Therefore some special 


property of the BMP ig. indispensable to get from (10.2) to 
(10.1). But how much do we need? If the strong Markov 
property is used, this can be done by reductio ad absurdum, 
using (7.15) with r = 1, and a form of the Borel—Cantelli 
lemma similar to Theorem 9.5.2 of [C], which Doeblin called 
the “street crossing lemma”. This argument requires only 
Jim. |X (n)| = +00, viz. 6 = 1 in (10.2). 

In general, it would be foolish to treat the BMP by approx- 
imating it with discrete skeleton — a remark made already by 
Bachelier who initiated the continuous time model (for stock 
market “speculation”). Let us therefore go to prove (10.1) 
by the new methodology. We begin by calculating the prob- 
ability of hitting one set before another. Let 0 < a < b < co; 
Ba = B(0,a), By = B(0,b), D = By\Ba. Then D is a domain 
with 0D = (OB,) U (OB,); it is regular because spheres are as 
smooth as they come and satisfy the cone condition, although 
we have not proved this. Define f to be equal to one on OB, 
and equal to zero on OB,; this is a continuous function on OD. 
Hence we know from Sec. 7 that the following function h is 
harmonic in D: 


h(x) = E*{f(X(Tpe))} = P*{Top, < Toz,} (10.3) 


where 7’, is defined in (4.2). In fact, h is the unique solution of 
the Dirichlet problem (D, f). It is geometrically obvious that 
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h depends only on |z| (not on the angle). Now the Laplace 


equation (1.2) for a function of r = \/x? + «2 + x2 takes the 
form 

d*h  2dh _ 

dr?" rdr 
This ordinary linear differential equation is easy to solve and 
has the general solution © + cz where c; and cz are constants. 
By (10.3), and the regularity of D, we have h(r) = 1 or 0 for 
r=aor b. It follows that 


(10.4) 


ab a 


Be) eal oa 


a< |x| <b. (10.5) 


Letting b t co, so that Taz, TF co, we obtain 


P*{Tap, < 00} = ia a<|z|<oo. (10.6) 


This formula can be extended to all x if we truncate the right- 
hand quantity by 1; and Tag, can be replaced by Tg, or Tp,. 
Now for a general set A, put 


ya(x) = P* {Ta < co}. (10.7) 
We have then 
Pipa = E*{pa(X;)} = E*{P**[T4 < o0]} 
= P* {As € (t,o): X, € A}; (10.8) 
and consequently by (4.3): 
Jim Piya(z) = P?{L4 = +00}. (10.9) 


Whether it is by nature’s grand design or by pure chance, 
the mathematics required here will lead us straight back to 
the original source in Sec. 1 and Sec. 2. Recall first the 
Newton-Coulomb potential uo(z) = |z|~* in (1.3), so that 
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the right member of (10.6) is auo; while the left member has 
been re-denoted by yg (x). Next recall P, from (2.5), and the 
quintessential representation of uo in (2.3). We have then 


CO 


Pruo(2) = f p(ts2,4) [r(siu,0)as dy 
R3 0 
= / , p(t; z, y)p(s; y, 0)dy| ds 
0 3 


= [ole 5:2, 0)as = [rle2,0)as : 
J (10.10) 


Of course, it is Fubini—Tonelli that is used first, then the con- 
volution property of the normal density, which makes {P;} a 
semigroup — every step is fundamental! Letting t + oo we 
obtain by (10.9) with A = B,: 


P*{Lg,=co}=0, ceER?. (10.11) 


Since any bounded set A in R® is a subset of B, for some 
a, we may replace B, in (10.11) by A. But if A is not Borel 
measurable, not even an analytic set, D4 may not be a random 
variable, how can we have P*{I4 = co}? Well, our probability 
space is complete (see (C; p. 383ff.]); if A C B, then Ly < Lz.,, 
{La = co} C {Lg, = co}. By completeness, when (10.11) 
is true, then P*{L4 = oo} is defined and is also equal to 0. 
Therefore, it is logical to speak of the transience of an arbitrary 
bounded set A. 

There is another big consequence of (10.6). For any x 4 0, 
ifa|0 then Tap, t Tio}, hence 


P®{T io} < co} =0. (10.12) 
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This result remains true for x = 0, an excellent little exercise. 
It has been announced before in (7.3): each singleton is a polar 
set. This result is also true in R?, but totally false in R!. The 
former assertion can be proved in a similar way as in R? shown 
above. The latter is “easy to see”, as Laplace used to say in 
his book dedicated to Napoleon. 
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11. LAST BUT NOT LEAST 


Let us now fix a compact set A and consider L4. So far we 
have ignored its measurability which is tied to that of T4, as 
follows. For each s > 0, define 


T4 =inf{s<t<oo: X,¢ A}. (11.1) 


A similar random time has already been used in (7.7). This 
T§ is the T'4 for the post-s process {X?} = {X.4+} introduced 
in Sec. 3. We have seen in Sec. 4 that for a closed A, T'4 is 
optional and so F..-measurable. Hence 74 is F5,-measurable, 
where F8, = o(Xi,8 < t < oo). Since FS, C Fo, T§ isa 
random variable (allowing +oo as value). Now the definition 
of L implies that for each s > 0, we have 


{ba Ssh Stly <00} (11.2) 
It follows firstly that L4 is ¥,.-measurable; secondly that 
{L4 > 0} = {Ta < 00} ; (11.3) 
thirdly that 
{0<L4 <s}={T4 < wo} —{T4 < oo}; (11.4) 


and fourthly for each z: 
P*{0 < La < 8} = P*{T4 < co} — P*{P**[T, < o]} 
= ya(z) — Pspa(z) (11.5) 
where ya is already introduced in (10.7). 
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The function y,4 is harmonic in A°. To see this let x € A‘ 


and B(x,r) € A°. Then using an old argument in Sec. 6, and 
writing OB for OB(z,r): 


pa(z) = Pel Ta < oo} = P* {Top <Ta < oo} 


= P*{P* (Te) 'T, < coj} = Hpya(s) . 
(11.6) 


Namely y has Gauss’s sphere-averaging property, and being 
bounded, is harmonic in A®. What can we say about y,4(z) 
for « € A? If A= D where D is a bounded domain then it is 
trivial that g(x) = 1 for x € D; if (D)° is regular as defined 
in (7.2), namely if E*{T,5} = 0 for x € OD, the same is true 
for x € D. 

It follows that each x € A‘, there is a neighborhood of z in 
which y, belongs to Cw , hence by a theorem on p. 190 of [L}, 


lim 742) — Pspalz) _ A 
s—0 s Z 


ya(x) =0. (11.7) 


In the case just mentioned, where yy is the constant 1 in A° = 
D (A° is the interior of A), the same conclusion holds if x € A®. 
Thus when A = D, the limit in (11.7) is 0 except possibly for 
£e.0OD. 

Previous Secs. 5 to 9 are all about Tp- and X(Tp¢). 
“Cherchez la symétrie”! So let us now find out about Lj and 
X(Lp). As before we write A = D for a bounded domain D, 
to “fix the ideas” as they say, but remark that in much of the 
sequel A may be an arbitrary compact set in R?. We proved in 
Sec. 10 that D4 < oo (a.s.). Is there any chance to say more? 
It follows from (11.2) that 
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E*{La} = i P*{L.4 > t}dt = / Pbacid 
0 0 


= Gya(x) . 


If A = B(0,a), then by (10.6), we have ya(x) = ia] \ 1 and so 
the above is equal to 


J amema (wi) 


R3 


Thus £7{L,4} = oo. Contrast this with (5.12): sup B7{T4-} < 


coo. Indeed the latter can be further strengthened to 
sup B® {exp(bT'4-)} < 00 for some b > 0. 


Let us turn to X(L4), and put 
Laf(x) = E*{La > 0; f(X(La))} - (11.8) 


Just as X(Tp-) € OD by continuity of the Brownian paths, so 
we have X(L.4) € 0A= OD when A = D. Thus it is sufficient 
in (11.8) to consider f € C(0A) to determine the distribution of 
the last exit place. However, we shall need to use f € C.(R?) 
below. [There is an old theorem by Lebesgue asserting that 
any function given on a compact set can be extended to a 
continuous function on the whole space.| Anyway, to “get hold 
of” the random variable X(L4) we will “function it” and then 
approximate from the left?? as follows, writing L for L,: 


L 
f(X1) = lim 5 / H(Xe)ds , (11.9) 


(L—6)* 


where (L—6)* = (L—6) V0. Now re-write the integral above as 
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F(Xs)ltscr<s+syas (11.10) 


On, 8 


so that we are in position to play the Fubini—Tonelli game. If 
we now take expectation and change the order of integration, 
we obtain 


[FUCK ecresiads} 2 (11.11) 
0 


See how the vital but tedious proviso “Z > 0” in (11.8) has 
bowed out automatically? We are ready to apply the Markov 
property for further revelation. Using (11.6) we obtain from 
(11.11): 


[¥ { f(Xs)E**(0 < L < 5)}ds 
0 
= | BK (ea— Prpa)(X.)}ds 
0 


=") [ f(va—Papa)(X.)ds 
0 


= G(f(yva — Pspa))(2) , (11.12) 


a small miracle of symbolism bringing us back to (3.3). Since 
f has bounded support and 0 < vy, < 1, we have 


G(\f| lya — Psval) < 00, 


in fact bounded by an old exercise in Sec. 2. Therefore, the 
calculations above are correct by Fubini-Tonelli, and upon 
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retracing the steps we see that we have proved the bewildering 
result below: 


eine pa ~ Pepa 
Laf =lime (J Facies ) (11.13) 


Actually we have also interchanged the order of the indicated 
limit in 6 with E*, which must be justified by bounded con- 
vergence, namely: 


L 
5 fh H%)ds| < Itt. 
(L—6)+ 


Put for all Borel sets B in R?: 


p5(B) = | jlealv) ~ Po alo)lay (11.14) 
B 


Then we can re-state (11.13) as follows, for f € C.(R%) and 
z € R3; 
fw) 


rar menu = Laf(z). (11.15) 
R3 


From this key relation we can deduce, without difficulty, the 
conclusions below. 


(a) As 6 | 0, the measure zs converges vaguely to a measure 
ta defined as follows: for any x € R?: 


wa(dy) = 2n|z — y|La(z, dy) (11.16) 


where the measure L(x, dy) is that indicated (4 la Bour- 
baki) by (11.8), with support in 0A. That jz4 has support 
OA has also been confirmed by the remark following (11.7). 
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(b) The measure j14 is atomless, namely .4({y}) = 0 for each 
ye R. 
(c) Turning the table around, we have for each z € R?: 


d 


La(z, {x}) =0. 


The ultimate equation above is worthy of comment. While 
La(a, {y}) = 0 for y ¥ z follows from (b), that single excluded 
case seems to require (10.2) in the form P*{T;,} < oo} = 0, 
which was left dangling there for a tease. 


12. LEAST ENERGY 


Summing up the results in the preceding section, we have 
for each compact subset A in R°, a measure 4 supported 
by OA with the following property. For each f € C(OA) and 
x é R?: 


E*{La >0; f(X(La))} = / a(ty)Ff(y)ua(dy) (12.1) 
OA 


where g is given in (2.3). Introduce the notation 


Gu(2) = " g(x, y)u(dy) (12.2) 


for any measure v, as a natural extension of Gf for a function 
f. Putting f = 1 in (12.1) and recalling (11.4), we obtain the 
celebrated relation 


P* {Ta < co} = Gua(z) . (12.3) 


The probability on the left side above is of course equal to 1 
for z € A®; it is also equal to 1 for x in 0A = O(A‘) which 
is regular as a boundary point of the open set A°, defined in 
(7.1). We will assume this below for all « € GA so that (12.3) 
includes 

VeaeA: Gua(r)=1. (12.4) 
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The measure 4 is known as the “equilibrium measure” 
for A, and has the electrostatic origin as nature’s manner of 
distributing a charge placed on the conductor A. The total 
charge is of course 44(A) which will be re-denoted by j14(1), 
conformable with the general notation wa(f) = f fdua for a 
function f. The potential function produced by this charge, 
under the Newton—Coulomb law, is clearly given by Gua, 
as a weighted average of g(-,y) with y distributed according 
to MA. 

This is constant throughout the conductor A, where the 
maximum potential is achieved, as seen from (12.3). Electro- 
statically speaking, the constant 1 in (12.4) amounts to a unit- 
taking. In this sense the total charge w4(1) is called the 
“capacity” of the conductor A and denoted by C(A). This 
is the maximum charge that the conductor A can support so 
that the resultant potential everywhere in space not exceed one 
unit. Hence the name “capacity”, but I do not know how Sir 
Isaac got it before electricity was invented. 


It is possible to extend this notion of capacity to all 
“capacitable” sets. As mentioned earlier in Sec. 4, Hunt used 
such capacities to prove that for all analytic, hence all Borel 
sets A, the random variable T'4 is an optional time. For the 
present mini discussion let us just note that C(A) = C(0A) 
although A may be 3-dimensional and 0A 2-dimensional. It is 
easy to deduce from (12.3) that for the solid ball A = B(0,r) 
or its surface, we have C(A) = 2mr. This can also be de- 
duced from the less informative relation (12.4) if we take z = 0 
there.?8 In fact, the rotational symmetry of the BMP implies 
that the distribution L4(0,-) should be uniform over 0A; hence 


by (11.16): 
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balay) = 2er 70) = Sola) 
pa(OA) = 50(0A) = 2s 


The existence of the equilibrium measure is a physical phe- 
nomenon, just as the induced charge distribution on Green’s 
conductor recounted in Sec. 1. As shown in (6.3) and (6.5), 
the latter, the harmonic measure Hp(z,-), is nothing but the 
probability distribution of X(Sp) produced by the Brownian 
paths at their first exit from a bounded domain D. Here 
the distribution of X(Lpj) produced by the same paths at 
their last exit from D yields the equilibrium measure for D 
through (11.16) and (11.17). Indeed, the last exit distribution 
Lp(z,-) tells a lot more than pp, owing to the additional vari- 
able x there. Both measures Hp and Ly are supported by 
OD, namely the electrical charges stay on the surface, because 
Brownian paths are continuous! 

Historically, it is curious that it took professional proba- 
bilists nearly three decades to take the little leap from a first 
exit (Kakutani 1944) to a last exit (1973).24 To quote from 
[Chung P]: 


For some reason the notion of a last exit time, which 
is manifestly involved in the arguments, would not be 
dealt with openly and directly. This may be partially 
due to the fact that such a time is not an “optional” 
(or “stopping” ) time, does not belong to the standard 
equipment, and so must be evaded at all costs. — “Like 
the devil”, as I added in a postcard to Henry McKean. 


To wind up the story in a classic mood (mode), I will tell 
a little about what Gauss did. It is a physical principle that 
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Nature takes the least action and spends the least energy. The 
energy of two charges of amounts (positive or negative) ; and 
Ag at the points 2; and 22 is given by the Newton-Coulomb 


law to be 
Ai A2 


|z1 — 9] 
A fantastic generalization (the circumspect Kellogg [p. 80] cau- 
tions about its validity) is as follows. For any two signed finite 
measures A; and Ag, their mutual energy is defined by 


one / Ar (dz)g(«,y)d2(dy) (12.5) 


provided the integral exists in the usual sense, after the decom- 
position into four parts using Hahn—Jordan’s decomposition: 
A = At — \~. We say X has finite energy iff (|A|,|A|) < co. 
It is a remarkable result that in the latter case (A,A) > 0, 
and moreover (A,A) = 0 only if A is the zero measure. See 
[L; Sec. 5.2] for proofs, but they are not necessary for an 
appreciation of the rest of this discussion. 

Gauss is known to be a terse person (although he gave 
seven proofs to a favorite theorem of his), and so used a more 
economic form of the energy above, to be denoted by 


Q(A) = (A, A) — 2A(1) . (12.6) 


For any A, we have by Fubini—Tonelli: 


(ua) =f *(d2) f oe,y)ualdy) =A*(1). (12.7) 


In particular 
(HA, MA) = MA(1) < 00. (12.8) 
Moreover if » has finite energy then so does \ — pg. 


Now consider an arbitrary signed finite measure \ having 
finite energy, and put A — “4 = v. We have then 
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QO) = (ua + ¥, Ha + Vv) — 2(ua +v)(1) 
= (wa, Ma) + 2(ua,v) + (vv) — 2ua(1) — 2v(1) 
= Q(ua) + (v,v) 


by using (12.7) with v for A+. Therefore, Q(A) > Q(pa) 
(= —ua(1)). Namely Q is minimized by 44 among all the 
A considered. This is Gauss’s method of determining the equi- 
librium measure as achieving minimum energy according to 
Nature’s predilection. However, he forgot to demonstrate the 
existence of such a minimizing measure, which lacuna was 
not filled until 1935 by Frostman. The same lacuna led to 
Dirichlet’s Principle. As we saw here the Brownian motion 
paths know how to produce the equilibrium measure without 
bothering with energy. Nature’s Ways are manifold. Basta 
Vultima uscita! 
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Addenda 


(I). Convergence at boundary 

In the discussion of (7.5), I lamented the apparent lack of 
an expedient mode of convergence. Soon after the first edition 
of the book appeared, Giorgio Letta showed me such a mode by 
a brilliant detour. It is based on the fundamental theorem: a 
sequence of random variables convergent in measure contains a 
subsequence convergent almost surely [C; Third Edition; The- 
orem 4.2.3]. 

Following Letta, we begin by changing our notation for the 
Brownian motion process by specifying X(0,w) = 0 for all 
w € 2, namely that all paths start at the origin in R?, so 
that «+ X(.) is the BMP starting at z € R¢. Now define the 
optional time, for a domain D: 


S(x) = inf{t >0:x2+4+ X(t) € D*}. 


The distribution of S(x) under P® is that of the previous Sp 
in (5.1) under P®. We write P for P° and E for E® below. 
If  € D then x + X(S(zx)) € OD. In our new notation, (7.6) 


becomes 
lim, ,,E{S(x)} = 0. 


Thus the random variable S(z) converges to 0 in L1(P), a 
fortiori in P-measure. Hence any sequence x, convergent to z 
contains a subsequence y,, such that S(y,,) converges to 0 a.s. 


89 


90 Green, Brown, and Probability 


The latter “strong” mode of convergence carries with it the 
a.s. convergence of X(S(yn)) to X(0), because t + X(t, w) is 
continuous for all or almost all w. Then of course yn,+X (S(yn)) 
converges as. to z+ X(0) = z. It follows by Lebesgue’s 
bounded convergence theorem that for any bounded continuous 
function f on OD we have 


limnE{ f (yn + X(S(un))} = F(z): 


For the sake of clarity, put c(z) = E{f(a+X(S(z))}. Looking 
back we see that we have proved: any sequence c(%,), where 
Xn — zZ, contains a subsequence which converges to f(z). This 
implies c(xz) + f(z) as (x — z, by a basic property of real num- 
bers which we will call the “Sotto-Sotto” Lemma; See [C; p.89]. 
In our previous notation c(x) = E*{f(X(Sp))}; thus (7.16) is 
proved. 

Exercise for the learner: Prove that X(S(x)) converges in 
P-measure to 0 as x — z. Beware: there are two hidden w 
there, see warning on p. 44. 


(II). Time arrow and Green Symmetry 

In Note 24 and elsewhere, reversing the time is hinted by 
“first entrance” versus “last exit”. There is a junior case of 
this: in a fixed interval of (0, ¢] reversing X(s) to X(t — s) for 
all s. Let 

Y le dgP*; 
Rd 

which means assigning the nonprobability Lebesgue measure 
to X(0) as initial “distribution”. Pronto we have, for any two 
Borel sets A and B, 


P*{X(0) € A; X(t) € B} = P*{X(t) € A; X(0) € B}.  (*) 
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This is proved by explicit calculation based on the symme- 
try of the transition density p(.;z,y). When A = R?@, we see 
that X(t) has the Lebesgue distribution for all t, namely the 
Lebesgue measure is invariant under the semigroup P;. Now 
consider a cylinder set as in (6.7): 


nm 
A= (]{X(tj) € Aj} 
j=l 
where 0 = to < t) < tg < +--+ < th = t and the A,’s are 
Borel sets with Ap = A, A, = B. Denote A the “reversed” 
set in which each t; becomes t — tj. Then the relation (*) 
may be enhanced by inserting A and A respectively into the 
left and right members. The proof is easy once the intuitive 
meaning of the symmetry of the transition is grasped, whereas 
an explicit check by multiple integral (as shown on p.184 of [L] 
and p.35 of [F]) may be hard to read! Next, let the relation 
be steadily refined by increasing the set of partition points t; 
to a denumerable dense set in (0,t). Then {t — t;} is likewise 
dense. Now let D be a compact set and take all the A; to be 
D, except A and B to be arbitrary Borel subsets of D. The 
continuity of X(.) implies then 
P*{X(0) € A; X(s) € D for0<s<t; X(t) € B} 
= P*{X(t) € A; X(s) € D for0< s < t; X(0) € B}. 

Finally let D be an open set and let D,, be compacts increasing 
strictly to D. Replacing D by D, in the above and letting 
n —> oo, we conclude that the preceding relation remains true 


for an open D, and now (why only now?) may be recorded 
using our previous notation in (5.1) as 


P*{X(0) € A;t < Sp; X(t) € B} 
= P*{X(0) € B;t < Sp; X(t) € A}. (**) 


92 Green, Brown, and Probability 
Introduce the kernel 
PP (x, A) = P*{t < Sp; X(t) € A} 


so that the Green potential in (6.12) may be written as 
co 
Gp =| PP dt. 
) 


In the new symbolism (**) becomes 


[ae-PP (eB) = [ de.PP (eA) (#**) 


Integrating over t and invoking Fubini we obtain 


| 42GB) = | dz.Gp(z, A). 
A B 


But we know Gp has a nice density that is Green’s (sic) func- 
tion, given in (8.9) and is obviously (extended) continuous. 
Hence the above reduces to the double integrals of Green’s 
function gp(x,y) over A and B in one way and over B and A 
in the other. Anybody can deduce from this that it is symme- 
tric in (a, y). 

The above is due to Z.Q. Chen to whom I proposed the 
“problem”. In fact, the relation (***) would imply the symme- 
try of Gil Hunt’s “relative” (his terminology) transition density 
p?(.;2,y), if only we knew the existence and continuity of the 
latter, as Einstein found out by solving the heat equation in 
the non-relative case of a “free” Brownian motion, see Sec. 3. 
In Hunt’s proof, reproduced faithfully in [F; 2.2], all is done 
together but the work is rather tedious. A lot of TIME has 
flown by; is there now an easier way? Merely solving Einstein’s 
equation with a boundary constraint won’t do (why?). 
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It is legend that the time arrow is reversible in the great 
equations of physics from Newton to Maxwell. It is highly plau- 
sible that this has to do with certain “duality” phenomenon in 
classical potential theory. But Hunt has to impose extra math- 
ematical assumptions in dealing with them in his “Hunt pro- 
cess” (1957-8), later extended by Blumenthal and Getoor, and 
others. Since then a general theory of reversing Markov process 
has been constructed by Chung and Walsh (see [T] and sub- 
sequent publications), yet regretably few concrete, physically 
significant applications seem to have resulted. It is partly in 
the hope of stimulating further progress in turning the arrow 
that this Addendum (II) is given here. 
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NOTES 


(1) All the mathematics in this section can be read in 
Green’s Essay cited in the References. The identity (1.1) here 
is his equation (2). The representation (1.9) is his equation (6), 
of which he said: “This equation is remarkable on account of 
its simplicity and singularity, seeing that it gives the value 
of the potential for any point p’ within the surface, when its 
value at the surface itself is known, together with the density 
that a unit of electricity concentrated in p’ would induce on 
this surface, if it conducted electricity perfectly, and were put 
in communication with the earth.” [Two dispensable symbols 
have been deleted from the original text, on p. 32.] 


(2) In the symbolism of Laurent Schwartz’s “distribution” , 
the calculations below amount to proving that 


A (a) =—4nd9 in R® 


where do is the Dirac function. This corresponds to Green’s 
equation (1) [numero uno] on p. 22. When I mentioned it to 
Laurent, he said he had not known. 


(3) Did any of the ancient greats (perhaps Poincaré, who 
knew both potential and probability) suspect a true bond 
between Newton—Coulomb and DeMoivre-—Laplace—Gauss? 
Could it be Euler who first evaluated the so-called infinite or 
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improper integral in (2.3)? I asked the late André Weil, who 
answered that he knew only Euler’s number theory, but the 
question seemed interesting to him. 


(4) In my book [E] I told the story of my rencontre with 
Dr. Einstein on Mercer Street, and his view of the foundation 
of probability. He also questioned me on the civil war in China, 
and opined that the United States should not get involved 
in it. 

(5) Paul Lévy in his book [P] attributed to L. Bachelier’s 
Thése: Théorie de la spéculation (1900) as the first mathe- 
matical work on Brownian motion by probabilistic methods. 
Feller in his book (Volume 2) even called it the “Bachelier— 
Wiener” process. Bachelier made a passionate plea for prob- 
abilities in continuous time and with absolutely continuous 
(distribution) functions, no approximations! As an application 
he mentioned (p. 333ff.) the analytic theory of heat, wrote 
down the normal density as solution to Fourier’s (heat) equa- 
tion. There is no question that his “new method” (title of one 
of his books), for which he claimed sole credit, was applicable 
to Brownian motion which should be known to him through 
(the French) Jean Perrin if not (the foreigner) Albert Einstein. 
Indeed, the movement of material particles due to hasard was 
treated in two chapters (XX, XXI) under the titles of “kinetic” 
and “dynamic” probabilities, although his principal model was 
the stock market. [His master Poincaré, whose patronage he 
acknowledged, was known to frequent roulette tables.] But 
after spending many hours leafing through his Thése and his 
grand Calcul, tome 1 (is there a tome 2?), and three paper- 
back monographs dated 1937, 1938 and 1939 which I bought 
I do not remember when, I was unable to spot anywhere the 
name Brown (or brownien). However, Lévy stated [Q; p. 123] 
that Bachelier published a new book on Brownian motion just 
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before the war “of 1939-1945”. [For me the war was at least of 
1937-1945.} I have not found that book and have never seen it 
mentioned by anybody else, but I have written to Michel Weil 
at Besancon, where Bachelier had his post, to find out. Enfin, 
rien. 

Personally I owe a great debt to Bachelier for a combina- 
torial formula in his Calcul, dealing with the Bernoullian case 
of max, |S,| in my thesis (1947). How did I know to look into 


that formidable volume on the fourth floor of the old Fine Hall 
and to find what I wanted, is a case of la chance. I looked at. it 
again last night on pp. 252-253, but could no longer decipher 
it. [April 10, 1995.] 

(6) Lévy [Q; p. 97]: “Since I have decided to conceal nothing 
of my own feelings, the moment has arrived to say that, among 
the opportunities that I had not been able to seize, that of 
leaving it to Wiener the discovery of this function X (t), despite 
the fact that existence of precursor diminishes its importance, 
was one of those that I regretted most.” The fairly literal 
translation is mine. Apropos I should also quote half a sen- 
tence from Wiener [D]: “The present paper owes its inception 
to a conversation which the author had with Professor Lévy in 
regard to...”. 

(7) Years ago, a then young probabilist submitted a paper 
in which he announced, more or less, that the notion of a 
separable process was obsolete. This reminds me of many 
youngsters of today who had never dipped a pen in ink. In 
contemporary culture, such a fountain pen (called also ink-pen) 
beautifully crafted and gilded, is still in vogue as a gift item. 

(8) In 195?, at a seminar in Columbia University, I showed 
something using the Markov property. A colleague Areyh 
Dvoretzky stood up after my talk, went to the blackboard and 
gave a shorter argument based on the mis-use of the property 
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as indicated above. Of course he admitted the faux-pas on 
the spot and no harm resulted. According to Feller, the same 
mistake occurred in a popular expository article by a renowned 
scientist, but since I was unable to retrace it in print, it is better 
not to spread the gossip. 


(9) This little exercise was assigned to the audience (a kind 
of summer institute) in Dalian, China, 1991. Later I asked the 
local instructor to spread it among students and staff. I regret 
to report that after two years nobody had done the problem. 
Is it really that hard? 


(10) I consulted Jean-Claude Zambrini on Einstein’s text. 
He responded: “Einstein needed the Markovian character of his 
Brownian particle and not the independence of increments over 
intervals 7 in a modern probabilistic sense.” A more “realistic” 
model has been proposed by L. S. Ornstein (already mentioned 
by Firth) and Uhlenbeck, but that is another story. 


(11) Green used “potential function”, Gauss used “poten- 
tial”; see Kellogg [p. 52]. 


(12) I owe O. Kallenberg this attribution; previously I 
attributed it to E. Dynkin. 


(13) The bounded measurability of Hpf is essential here. 
Suppose h is merely finite and integrable over every sphere in 
R¢ (d > 1) and the sphere-averaging property holds for all z 
and all B(z,r). Is then h harmonic in R4? Nobody seems 
to know the answer but I should conjecture “no”, except for 
d = 1 where the answer is known to be yes. Local integrability, 
namely the integrability of h over every ball, is a sufficient sup- 
plementary condition for h to be harmonic. This is a wonderful 
case of the non-use and mis-use of the Fubini—Tonelli theorem 
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reviewed in Sec. 2 above. In an early draft of [F], Z. Zhao 
made the mistake of deriving (6.14) from (6.13) “simply by 
integrating”. My treatment in [L; pp. 154-6] is more cautious, 
but some just don’t get it. 


(14) In general Markov processes, the point x is defined to 
be regular for the set A iff H*{T74} = 0. Thus here z is regular 
for D°. 


(15) In F. Knight’s book Brownian Motion And Diffusion 
(1981), the proof of this assertion is by these words: “By the 
boundedness and continuity of f” (p. 67). He might as well 
have said, “Since two and two is four”. J know Frank well and 
he is an honorable man. Was he aware of the little difficulty 
there or was he merely pretending to be concise? I propose 
to any instructor of this stuff to set a quiz for his class to 
obtain an opinion poll on the problem. Since the editor of the 
Mathematical Surveys originally wanted me to review Frank’s 
manuscript but I declined, this is a belated comment for the 
edification of future authors. 


(16) He proved that in his differential space the set of x(t) 
in the sphere of radius r and satisfying the Hélder condition 


1 
|x (te) _ x(t1)| > ar(te _ iy) oo 0<e< 3 ; 
“for rational t, be it understood” (he warned), has an outer 
measure in Daniell’s sense which goes to zero as a goes to 
infinity. 

(17) I consulted a physicist (gravity expert) on this ques- 
tion, and was referred to Maxwell’s book. Later I found out 
from the Appendix to [Green, p. 327] the following statement. 
“The important theorem of reciprocity, established in Art. 6, 
may be put in a clearer light by the following demonstration, 
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which is due to Professor Maxwell.” It seems a little odd (to 
me) that neither he nor Maxwell used the argument indicated 
here. 

I should add that Hunt in his 1958 paper has gone deeper 
into this symmetry. He constructed the transition density of 
the Brownian motion “killed off D”, denoted by p?(t;z,y), so 
that for any Borel set A: 


[Pia.aay = P*{t< Sp; X,€ A}. 
A 


Then the Green function for D can be defined simply by 


foe) 


gp(z,y) = [ere x, y)dt 
0 


just as g(x,y) can be defined by (2.3). He proved that for 
each t > 0, p?(t;z,y) is symmetric in (x,y) € D x D. The 
symmetry of gp(x,y) becomes a trivial consequence. By the 
way, there is NO probabilistic intuition for the symmetry of 
p? (t;,-,-) although Hunt tried to make it appear a little more 
plausible in his argument. In [F] we give a straightforward 
computational verification. 


(18) An apparently more intuitive way to define the pre- 
T tribe is the tribe generated by the stopped process X(t A 
T), t € Ro. But this is inadequate because the random time 
T does not belong to it, as pointed out in [O]. In drafting 
this manuscript I found in Hunt’s paper [p. 299] that he was 
quite aware of this, unlike another author who had adopted 
the defective definition without reflection. The present simple 
definition is due to A. Yushkevich. There is another longer but 
pleasant alternative: for a completely arbitrary random time T 
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we can define a strict-pre-7' tribe denoted by F7_, then define 
Fr+ to be A F(r41)-. What could be more natural? Now 


if T is éoeonal mirabile dictu, the latter is exactly the Fri 
defined in the text. I’ll let you figure out how to define that 
Fr— but you may consult [L; Sec. 1.3 and Exercise 4]. By the 
way, to appreciate a “trivial thing” like Fp, read Doob’s 1953 
book to see how he messed it up there. 


(19) Regarding the origin of the strong Markov property, 
Hunt said [p. 294]: “Although mathematicians use this ex- 
tended Markov property, at least as a heuristic principle, I have 
nowhere found it discussed with rigor. We begin by proving 
a precise version for Brownian motion ... . I have not pushed 
the scope of the proof to the limit because it requires conti- 
nuity of the sample functions on the right and it is probable* 
that a version of the extended Markoff property holds for pro- 
cesses which are only separable.” In the case of Markov chains, 
namely on a countable state space, the said property is proved 
by me using the version of the process whose sample functions 
are lower-semi-continuous on the right, which is deduced from 
their separability |M; Sec. II.9]. In general state space with 
right continuous sample function and a Feller condition, the 
property was proved by R. M. Blumenthal and A. Yushkevich, 
independently; see e.g., [L; Sec. 2.3]. The historical account 
given in a recent pamphlet is inaccurate, according to the last- 
named author, but I have not seen it. 


(20) As many popular books by and about Richard Feyn- 
man have appeared, allow me to add an anecdote here which 
I had told at a conference at Cornell University in 1993. One 
summer night, “when we were all young”, I met Richard at 


*Gil’s surmise is wrong. 
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the popular night-spot College Spa on State Street in Ithaca, 
now defunct. One thing or another made me propose a little 
mathematical quiz to him: let an arbitrary triangle be given, 
mark a third length on each side and draw a line segment 
joining the division point to the opposite vertex. Lo and behold 
a smaller triangle appears in the middle, being one of the seven 
into which original triangle has been partitioned. Prove that 
its area is exactly one seventh of the original! Richard 
found this unbelievable, took out his pen or pencil, started 
computing on a napkin grabbed from the table, and after a 
couple of minutes announced that it must be wrong because 
his approximation did not support it. I bet him a quarter, and 
he re-did his computations. Eventually he paid up gamely after 
I told him to check it for an equilateral triangle.* My old friend 
Carlo Riparbelli (an aeronautical engineer) was with us, and 
many years later sent me two different proofs on large drafts- 
man’s sheets. Shortly before Feynman’s death I telephoned 
him (to my surprise he had a listed number) to ask his per- 
mission to quote him on the “eternal futility” (my words, but 
he said so in substance) not only of high mathematics but also 
high physics and astronomy, spoken by him on the Nova pro- 
gram and available in print (“The pleasure of finding things 
out”, January 25, 1983). My article containing his remarks 
was published in Singapore, 1984, after I gave a talk there. 
Later, the late Mr. Shi Héng-gao, who was my English teacher 
in high school and held ministerial positions for Mao’s China 
in the fifties-sixties, translated it into Chinese because, as he 
told me, it should serve as a “cooling medicine” for the current 


*Such a quick solution by projection was found by my other friend Armin 
Perry, who took my Advanced Calculus at Cornell and later worked at 
General Electric at Syracuse when I told him the problem. He did it 
without any hint. 
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Chinese science scene. Despite his good connections his manu- 
script was refused publication as being counter-something-or- 
other, until 1988 shortly before the Tiananmen tragedy (Nature 
Journal, vol. 11, no. 9, 1988, Shanghai). The editor Mr. Zhang 
was dismissed after the tragedy as too “liberal” — I heard. 
Even Feynman, in the committee investigating the fatally 
exploded rocket (1986), had to buck the politicians and bureau- 
crats to present his testimony with a glass of (icy?) water to 
the U.S. congress and television audience. It was a marvellous 
performance. 


(21) After I proposed this little problem at the Fifteenth 
Seminar On Stochastic Processes held at the University of 
Florida in Gainesville, March 9-11, 1995 and got no response, 
J asked my colleague Y. Katznelson. He wrote down an ex- 
ample with explanation in six lines (which may need a little 
amplification). It is one of those things in mathematics where 
if you really see the pinch you can fix the shoe. Try it. 


(22) The adjective “left” should sound a drum here. A 
Hunt process has right-continuous paths, and an optional time 
is approximated from the right, a crucial step in the derivation 
of the strong Markov property from the (simple, unqualified) 
Markov property under certain conditions, including the case 
of Brownian motion (not given in this book but see Note 19). 
But when the last exit is concerned, time must be reversed 
and so “right” becomes “left”. You will see presently that 
approximating D4 from the right will not do. 


(23) When A is the unit ball one can compute js, directly 
from (11.7) and (11.14). I asked Z.Q. Chen to do it. He 
used spherical coordinates and confirmed that it is the uni- 
form distribution over the surface. The details are a bit too 
long to be given here. Sometimes an abstract general argument 
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does better than a special calculation — as if we did not know 
that! 


(24) Whereas the notion of a first happening was utilised 
early in probability arguments, that of a last happening was 
overlooked. As far as I know, “last exit” made its first for- 
mal entry around 1950 in a decomposition formula in Markov 
chains, as dual to “first entrance”; see my elementary book 
([E; Sec. 8.4] and the reminiscent [R]. In continuous time the 
notion is much more subtle and requires careful handling. Of 
course, if the legendary time-arrow is reversed in direction, 
then future becomes past and first becomes last. Easy to 
say but hard to do, rigorously. Physicists talk a lot about 
reversibility and some of the electrical phenomena discussed 
in this book have been interpreted in that light. For example, 
the equilibrium problem has an extension from one conductor 
to two (or more), known as the “condenser problem”. This 
can be treated by probabilistic methods, but we (Chung and 
Getoor, 1977) found that certain “duality hypothesis” was 
needed. We know (see [T]) that any decent Markov process 
can be reversed in time to yield another (slightly weaker) one, 
but unfortunately this innate reversibility has not produced the 
analytic dual structure suitable for classical potential theory, 
see e.g. [L; Chapter 5]. In a sense, the consideration of last exit 
serves as an easier way out of the intractable time-reversal. 
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APPENDICES 


(I) George Green 


INTRODUCTORY OBSERVATIONS. 


THE object of this Essay is to submit to Mathematical Analysis 
the phenomena of the equilibrium of the Electric and Magnetic 
Fluids, and to lay down some general principles equally ap- 
plicable to perfect and imperfect conductors; but, before enter- 
ing upon the calculus, it may not be amiss to give a general 
idea of the method that has enabled us to arrive at results, 
remarkable for their simplicity and generality, which it would 
be very difficult if not impossible to demonstrate in the ordi- 
nary way. 

It is well known, that nearly all the attractive and repul- 
sive forces existing in nature are such, that if we consider any 
material point p, the effect, in a given direction, of all the 
forces acting upon that point, arising from any system of bodies 
S under consideration, will be expressed by a partial differential 
of a certain function of the co-ordinates which serve to define 
the point’s position in space. The consideration of this function 
is of great importance in many inquiries, and probably there are 
none in which its utility is more marked than in those about to 
engage our attention. In the sequel we shall often have occasion 
to speak of this function, and will therefore, for abridgement, call 
it the potential function arising from the system S. Ifp bea 
particle of positive electricity under the influence of forces arising 
from any electrified body, the function in question, as is well 
known, will be obtained by dividing the quantity of electricity 
in each element of the body, by its distance from the particle p, 
and taking the total sum of these quotients for the whole body, 
the quantities of electricity in those elements which are negatively 
electrified, being regarded as negative. 
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(II) L. Bachelier 


PREFACE. 


Cet Ouvrage n’a pas seulement pour but d’exposer, avec 
quelques développements, l'ensemble des connaissances 
acquises dans le passé sur le calcul des probabilités, il a aussi 
pour objet de faire connaitre de nouvelles méthodes et de 
nouveaux résultats qui représentent, 4 certains points de vue, 
une transformation compléte de ce calcul. 

La conception des probabilités continues constitue la base 
de ces nouvelles études. On pensait précédemment que seules 
des formules discontinues pouvaient étre des conséquences 
exactes des principes du calcul des probabilités, et cette idée 
était d’autant plus naturelle que les problémes traités alors ne 
pouvaient admettre d’autres genres de solutions. 

On employait bien parfois des formules continues mais on 
les considérait comme approchées, de sorte qu’elles ne pou- 
vaient servir de base pour de nouvelles recherches. Pour cette 
raison, leur emploi ne s'est pas généralisé depuis Laplace. 

L’idée de considérer les probabilités comme continues fut 
seulement envisagée il y a quelques années lorsque je me pro- 
posai de résoudre des questions ne pouvant admettre que des 
solutions continues exactes. 

La théorie édifiée alors était assez particuliére, des ébauches 
publiées dans différents Recueils indiquent le cours de son 
évolution; généralisée successivement dans tous les sens, elle 
a pris un tel développement qu'il m’a semblé nécessaire de la 
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(III) Wiener 


DIFFERENTIAL-SPACE 


By Norsert WIENER 


§1. Introduction. 

§2. The Brownian Movement. 

§3. Differential-Space. 

§4. The Non-Differentiability Coefficient of a Function. 

§5. The Maximum Gain in Coin-Tossing. 

§6. Measure in Differential-Space. 

§7. Measure and Equal Continuity. 

§8. The Average of a Bounded, Uniformly Continuous Func- 


§9. The Average of an Analytic Functional. 
§10. The Average of a Functional as a Daniell Integral. 
§11. Independent Linear Functionals. 
§12. Fourier Coefficients and the Average of a Functional. 


§1. Introduction. The notion of a function or a curve as an 
element in a space of an infinitude of dimensions is familiar to all 
mathematicians, and has been since the early work of Volterra 
on functions of lines. It is worthy of note, however, that the 
physicist is equally concerned with systems the dimensionality 
of which, if not infinite, is so large that it invites the use of limit- 
processes in which it is treated as infinite. These systems are the 
systems of statistical mechanics, and the fact that we treat their 
dimensionality as infinite is witnessed by our continual employ- 
ment of such asymptotic formulae as that of Stirling or the 
Gaussian probability-distribution. 

The physicist has often occasion to consider quantities which 
are of the nature of functions with arguments ranging over such 
a space of infinitely many dimensions. The density of a gas, or 
one of its velocity-components at a point, considered as depend- 
ing on the codrdinates and velocities of its molecules, are cases 
in point. He therefore is implicitly, if not explicitly, studying the 
theory of functionals. Moreover, he generally replaces any of 
these functionals by some kind of average value, which is essen- 
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(IV) Kakutani 


143. Two-dimensional Brownian Motion and Harmonic 
Functions. 


By Shizuo KAKUTANI. 
Mathematical Institute, Osaka Imperial University. 
(Comm. by T. TAKAGI, aL1A,, Dee, 12, 1944.) 


1. The purpose of this paper is to investigate the properties of 
two-dimensional Brownian motions” and to apply the results thus ob- 
tained to the theory of harmonic functions in the Gaussian plane. Our 
starting point is the following theorem: Let D be a domain in the 
Gaussian plane R®, and let E be a closed set on the boundary Bd(D) of 
D. Then, under certain assumptions on D and E, the probability 
P(t, EB, D), that the Brownian motion starting from a point Ce D will 
entcr into E without entering into the other part Bd(D)—E of the 
boundary of D before it, is equal lo the harmonic measure in the sense 
of R. Nevanlinna of H with respect to the domain D and the point C. 

It is expected that, by means of this method, many of the known 
results in the theory of harmonic or analytic functions will be inter- 
preted from the standpoint of the theory of probability. We shall here 
ive only the fundamental results and a few of its applications, leaving 
the detailed discussions of further applications to another occasion. 

Most of the results obtained in this paper are also valid for the 
ease of higher dimensional Brownian motions. But there are also many 
theorems in which the dimension number plays an essential rdle?. For 
example, Theorems 6,7 and 8 of this paper are no longer true in F’. 
The situation will become clearer if we observe the following theorem : 
Consider the n-dimensional Brownian motion in R"(n = 2), and let K" 
be the closed wnit sphere in R". Then, for any Ze R"—R", the pro- 
bability P(¢, K", R"-K") that the Brownian motion starting from ¢ will 
enter into K” fur some t>0 is equal to |C\** if n23”, while this 
probability is = 1 on R*-—K” if n=2". This result is closely related 
with the fact that there is no bounded harmonic function, other than 
the constant 1, which is defined on R"—- A" and tends to1 as [¢|—1, 
while, for any n= 3, w()=]o[F°* is a non-trivial example of a bounded 
harmonic function with the said property. 

2. Let (z(t, w) = (x(t, w), y(t, w)} |-<o Sh < Oo, we L} be a two- 
dimensional Brownian motion defined on the z= {x, y}-plane R?, i.e. an 
independent system of two one-dimensional Brownian motions {2(t, «) | 


1) Brownian motions were discussed by N. Wiener and Ll. Lévy. Cf. N. Wiener, 
Generalized harmonic analysis, Acta Muth. 54 (1930); N. Wicner, Homogeneous chaos. 
Amer. Journ. of Math, 60 (1939); R.I. A/C. Paley and N. Wiener, Fourier transforms 
in the complex domain, New York, 1983; P. Lévy, L'addition des variables aléatoires, 
Paris, 1987; P. Lévy, Sur certains processus stachastiyues homogénes, Compositio Math., 
7 (1939); P. Lévy, Le mouvement brownien plan, Amer. Journ. of Math., 61 (1940). 

2) Of. R. Nevanlinna, Kindeutige analytische Funktionen, Berlin, 1937. 

3) Gf. S. Kakutani, On Brownian motions in -space, Proc. 20 (1944). 

4) |¢] denotes the Kuclidean distance of ¢ from the origin of Ra, 

5) ‘The case n=2 is contained in Theorem 4 of this paper. 
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SOME THEOREMS CONCERNING BROWNIAN MOTION 


BY 
G. A. HUNT(‘) 


Given a random time T and a Markoff process X(r) with stationary 
transition probabilities, one can define a new process X’(r)=X(r+T). It is 
plausible, if T depends only on the X(r) for r less than 7, that X’(r) is a 
Markoff process with the same transition probabilities as X¥(r) and that, when 
the value of X’(0)=.X(T) is fixed, the process X’(r) is independent of the 
history of X(r) before time T. Although mathematicians use this extended 
Markoff property, at least as a heuristic principle, | have nowhere found it 
discussed with rigor. We begin by proving a precise version for Brownian 
motion (Theorem 2.5,with an extension in §3.3). Our statement has the good 
points that the hypotheses are easy to verify, that the proof is thoroughly 
elementary (it even avoids conditional probabilities), and that it holds for all 
processes with stationary independent increments (see §3.1). I have not 
pushed the scope of the proof to the limit because it requires continuity of the 
sample functions on the right and it is probable that a version of the extended 
Markoff property holds for processes which are only separable. 

In §4 and §5 we use the extended Markoff property to study the tfansi- 
tion probability density g(r, 7, 5) of a Brownian motion in R* in which a par- 
ticle is extinguished the moment it hits a closed set E. lt turns out thar 
g(r, 7, 5), Which can be defined without ambiguity for every r and s in R®, is 
the fundamental solution of the heat equation on R*—E with boundary 
values 0 (at the regular points of E), and that G(r, s)=Jog(r, 7, s)d7 is the 
Green’s function for the Laplacian on R"—E. The content of these sections 
must be considered more or jess as common knowledge; however, | believe 
some of the details are new, especially those concerning irregular points and 
the probabilistic interpretation. 

In writing §4 and §5 I profited from many discussions with Mark Kac and 
Daniel Ray. Had I known of Doob’s fundamental paper [4] on the heat 
equation it is likely that these sections would have been cast in a different 
form. 

Kac has made the following conjecture. Consider in the plane a closed 
bounded domain E and a point 7 outside E; the probability that a Brownian 
motion starting from 7 does not meet E by time 7 is for large r asymptotic 
to 2rH(r)/In r. Here H(r) is the Green’s function of R?’~£ with pole at in- 


Received by the editors Juiy 20, 1955. 
(*) This research was supported in part by the United States Air Force. through the Office 
of Scientific Research of the Air Research and Development Command. 
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(VI) Chung 


PROBABILISTIC APPROACH 
IN POTENTIAL THEORY 
TO THE EQUILIBRIUM PROBLEM 


by Kai Lai CHUNG (') 


The problem of equilibrium is the first problem for the ancient 
period of potential theory recounted by Brelot in his recent histo- 
rical article [1]. The existence of an equilibrium measure for the 
Newtonian potential was affirmed by Gauss as the charge distribution 
on the surface of a conductor which minimizes the electrostatic energy. 
But it was Frostman who rigorously established the existence in his 
noted thesis (1935), and extended it to the case of M. Riesz potentials. 
Somewhat earlier, F. Riesz had given his well-known decomposition 
for a subharmonic function which is a closely related result. For 
further history and references to the literature see Brelot’s article. 
From the viewpoint of probability theory, the equilibrium problem 
in the Newtonian: case takes the following from : 


P*{Ty < 0} = u(x, y) Ug (dy). (1) 
OB 


Here the underlying process is a Brownian motion {X,, ¢ + 0} in 
R? ; P*{---} denotes the probability (Wiener measure) when all paths 


issue from the point x ; B is a compact set (the conductor body) ; 
Ty = T,(w) is the hitting time of B oy the path w : 


T,(w) = inf {rt > 01 X,(w) € B} ; 
OB is the boundary of B ; u(x, y) is the associated potential density 


l 
ah aa Pe AT 


(1) Research supported by the Office of Scientific Research of the United 
States Air Force, under AFOSR Contract F44620-67-C-0049. 
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Generalities 


The triple (W, F, P) is a probability space: W is an ab- 
stract space of points w, F is a Borel tribe of subsets of W 
containing W as a member, P is a probability measure on F’. 

A classic example: W = (0, 1], F = the Borel sets in (0, 1], 
P = the Borel measure. Another example: W = [0,1], F = the 
Lebesgue measurable sets in [0, 1], P = the Lebesgue measure. 
The latter has the property of “completeness” not shared by 
the former. Namely a subset of a set of Borel measure zero may 
not be Borel measurable, but a subset of a set of Lebesgue mea- 
sure zero is always Lebesgue measurable (hence has Lebesgue 
measure zero). It is always possible to “complete” the prob- 
ability space and so we may as well assume this: if A C B, 
BeF and P(B) =0, then Ac F. 

We define a stochastic process on the triple as a collection 
(space) of functions with domain D, parametrized by w in W: 


AX(w,-):t>+X(w,t), tED; 


such that for each t € D, X(-,t) is a random variable on the 
triple, namely a function of w with domain W and measurable 
with respect to the Borel tribe F. In the following we shall 
take the range to be the compactified real line [—oo, +00]. The 
random variable X(.,t) will also be denoted by X;(-) or X¢. 
Elementary knowledge of random variables is presupposed. 
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An older definition of stochastic process is {X;,t € D}. 
In this form it is a collection (family) of random variables 
parametrized by t in D. The two definitions are equivalent 
and serve different ends, as we shall see. 

We have yet to specify the parameter set D. When D is 
the set of natural numbers the second definition of a process 
becomes a sequence of random variables studied in basic proba- 
bility theory. Here we shall concentrate on the case D = (0, 00), 
the positive (sic) real numbers, and t will be called the “time”. 
Thus the stochastic process becomes the chance evolution of a 
random real variable X(-,¢) with the passing of TIME. 

The random variables {X,,0 < s < t} generate a Borel 
tribe to be denoted by F;; as t increases this increases (i.e. does 
not decrease) to Fy. The random variables {X,,t < s < co} 
generate a Borel tribe denoted by FY. 

The process {X;} is called a Homogeneous Markov Process 
iff for each t and each A € F; we have 


P{A|Fi} = P{A|X4} (1) 


where the left member is the conditional probability of A rel- 
ative to F; and the right member the conditional probability 
relative to the tribe generated by the single X;. It is essential 
to understand the exact meaning of the conditioning (which is 
not a common notion in analysis, classic or functional!) When 
the random variable X; takes discrete values x we may write 
P{A|X; = x}; in general this symbolism must be interpreted in 
a Radon—Nikodym derivative sense as we shall do occasionally; 
see Chung [3; Sec. 9.1] for a full discussion. 

When the time parameter t is the natural numbers, the 
homogeneous Markov Property (1) has a facile extension called 
The Strong Markov Property, as follows. Let 7 be a positive 
finite random variable such that for each natural number n we 
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have {w: T(w) =n} € F,. Such a T is called “optional”. Let 
Fr denote the collection of sets A in F’ such that for each n we 
have AN{w:T(w) =n} € F,. Then Fr forms a Borel tribe, a 
subtribe of F’. Let 7 denote the tribe generated by the “post- 
T process {Xp41,t € D}, where X7ry:(w) = X(w,T(w) + 
t). Then the strong Markov property consists in replacing the 
constant ¢ in (1) by the random T everywhere, viz. for each 
A € Fp we have on the set {T < oo}: 


P{AlFr} = P{A|Xp}. (2) 


When the time parameter t is in D = [0,00), the situa- 
tion is more delicate. A positive random variable T' is called 
“optional” iff for each t € D, we have 


{w:T(w)<the k. (3) 


For an optional T we define Fr to be the Borel tribe of sets A 
in F' such that for each t € D we have 


An{w:T(w)<the Fk. 


The assertion that Fy is a Borel tribe is proved by using the 
definition of T. Define the post-T tribe F/, as before. 

A positive constant to such as 17 is a random variable. Is 
it optional? We must check it out against the definition and 
the answer is YES. Next, when T is the constant t, is the tribe 
Fr equal to F,? Again we must check it out, and the answer 
is NO. In fact, when T’ = t we have 


Fr = Fi = [\Fs 
s>t 


which may be larger than F;. This is what I meant by “deli- 
cate”. However by some permissible manipulation, we may 
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suppose that for every t € D, we have Fi4 = F;; see [L; p. 61].* 
Then it follows that the condition (3) for optionality may be 
changed by replacing the “<” there with “<”. All this stuff 
may seem hair-splitting nuisance to a beginner or a practical 
man in a hurry, but is required by relentless logic. Now we are 
ready to state the Strong Markov Property. 

The homogeneous Markov process {X;,t € D} is said to 
have the strong Markov property if and only if (1) remains 
true when the constant t is replaced by an optional random T. 
When T is a constant t, this reduces to (1), as we have seen. 

For continuous time parameter ¢, this stronger property is 
no longer a consequence of the plain Markov property given 
in (1). It must be assumed (as we shall) or proved under addi- 
tional hypothesis on the process. 

Returning to our first definition of process, we call the func- 
tion oft € D: t + X(w,t), or more simply X(w,-), the sample 
function corresponding to w. When its range is specified as 
above, it is a function of the real variable ¢ taking real values 
or the two limiting values +oo and —oo. This is the kind of 
functions studied in calculus. It may be continuous or discon- 
tinuous at each point ¢ and may go to infinity somewhere. For 
a given stochastic process, the nature of its sample functions 
depends on the initial hypotheses on the laws of probability 
governing the movement (evolution) of X; as the time t pro- 
gresses. This is expressed by a transition probability kernel as 
follows. For t in D, x in (—o0, +00), and Borel set A, we put 


P,(a, A) = P?{X; € A} = P{Xe41 € A|X, = 2}, sED. 
(4) 


As a linear operator, P; is defined on the class of bounded Borel 
functions f by 


“See References in Part I. 
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+00 
Pef(a)= f Pale, dy fy) 
with the semigroup property P,,; = P,P;. Observe that there 
is NO random element in the semigroup, and read [GBAP,”* 
Sec. 4] for commentary. 
For the Brownian Motion [Movement] On Line, we have 


_@- =i 


A i ae ay (5) 


where the integrand is the probability density of the normal 
distribution with mean x and variance t; it will be denoted 
as pz(x, y) or p(t; x,y). Read [GBAP, Sec. 2] for historical per- 
spective on this particular case. It is proved that all the sample 
functions of this process may be taken to be everywhere contin- 
uous; and it is a homogeneous Markov process with the strong 
Markov property: a “strong Markov process”. The proofs of 
these fundamental properties of the Brownian Motion may be 
found in [L]. 

We end these preliminaries with a discussion of a primary 
class of optional random variables. For a process taking values 
in R?, the Euclidean space of dimension d > 1, and any Borel 
set there, define 


Ta(w) = inf{t > 0: X(w,t) € A} (6) 


where the inf is +oo when the set is void. We shall call T, 
the “hitting time of A”. If we change the “t > 0” in (6) into 
“t > 0”, the result is a different function of w called the “first 
entrance time in A”. The difference can be tremendous, see 


*Namely Part I in this edition. 
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IGBAP, p. 25] for an anecdote. Now if the time parameter D 
is the natural numbers, it is trivial to verify that T4 is optional. 
When D = (0, co), conditions must be imposed on the process 
and on the set A to make T, optional. [By the way, how do 
we know 74 is even a random variable?| 

We prove that if all sample functions are continuous, and 
A is either an open set or a closed set, then 7'4 is an optional 
random variable. 

Let A be open. Then we have for t > 0, omitting w below: 


{T,4 < t} = {there exists s in (0,t) such that X, € A} 
= {there exists r in [0,t) Q such that X, € A} 


where Q is the set of all rational numbers, and the equation is 
due to the openness of A [it may be false for a closed A!], as 
well as the continuity of X(-). Hence 


{Ta<t}= |) {X, cA}er 
reE[0,t)NQ 


because X, € F;, A is Borel, and F; is a Borel tribe. 
Let B be closed. There exists a sequence of strictly decreas- 
ing open sets O, such that ()7°_, O, = B. Then we have 


= U () {there exists r in [¢/k,t] Q such that X, € O,}. 


It is a good exercise in elementary topology to prove the equa- 
tion above, which is more than needed for the optionality of 
Tp (if we do not assume Fi, = F; as discussed earlier). 

In the case d = 1, namely in the real line, if A is an open 
interval or closed interval, and the process starts outside A, 
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it is geometrically obvious that hitting the bounded interval 
(a, b) or [a,b] is the same as hitting one of the two end points a 
or b; hitting the interval (6, 00) or [b, co) is the same as hitting 
the singleton 6. Thus 74 is reduced to T, A J, or Ty. What 
happens when the process starts inside the interval is obvious 
except when it starts at an end-point. This peculiar situation 
is tricky and will be discussed for the Brownian motion below. 


Introduction 


These lecture notes are based on a short course given at Stan- 
ford University in the fall of 1979.* The audience consists of 
graduate students from several departments and some faculty 
auditors. They are supposed to know the elements of Brownian 
motion including the continuity of paths and the strong Markov 
property, as well as elementary martingale theory. The empha- 
sis is on methodology and some inculcation is intended. 

In the following {X(t),t > 0} is a standard Brownian mo- 
tion on R = (—00, +00). 


1. Exit and Return 


Let (a, b) be a finite interval and put 
T=T a,b) = inf {t>0: X(t) ¢ (a,b)}. (1.1) 


This is called the [first] exit time from (a, 6). It is optional. If 
x ¢ [a,b], then clearly 


p’{r=0}=1 (1.2) 


*Journal of Mathematical Research and Exposition 1 (1) (1981) 73-82 
and 2 (3) (1982) 87-98. 
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by the continuity of paths. If = a or x = b, the matter is 
not so obvious because the inf in (1.1) is taken over ¢ > 0, not 
t > 0. Starting at a, it is conceivable that the path might move 
into (a,b) without leaving it for some time. It is indeed true 
that (1.2) holds for z = a and x = b but the proof will be given 
later. Our immediate concern is: if the path starts at 2, where 
x € (a,b), will it leave the interval eventually? 
We have the crude inequality 


sup P*{r >1}< sup P*{X(1) € (a,b)}. (1.3) 
x€(a,b) xE(a,b) 


The number on the right side can be expressed in terms 
of a normal distribution but it is sufficient to see that it is 
strictly less than one. Denote it by 6. The next step is a basic 
argument using the Markovian character of the process. For 
any x € (a,b) andn > 1: 


P*{r >n} < EX{r >n—1;P*"-VIz > 1} 
< P*{r >n—1}-6. (1.4) 


In the above we have adopted the notation commonly used in 
Markov processes. It symbolizes the argument that if the path 
has not left (a,b) at time n — 1, then wherever it may be at 
that time, the probability does not exceed 6 that it will remain 
in (a,b) for another unit of time. If follows by induction on 
n that 

PP er She Oo, (1.5) 


Since 6 < 1 we obtain P*{r = co} = 0 by letting n > oo. 
This answers our question: the path will (almost surely) leave 
the interval eventually: 


Ptr < 60) =. (1.6) 
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In fact the argument above yields more. For any ¢ such that 
e°d < 1, we have 


[o-@) [e@) 
E*{e"} < So ee P*{n -l<r<n}< > 6" <a, 
n=1 n=1 


(1.7) 


In standard terminology this says that the random variable r 
has a generating function E*{e%"} which is finite for sufficiently 
small values of 6. In particular it has finite moments of all 
orders. 

At the exit time 7, X(T) = a or X(T) = b by continuity. 
For any x € (—00, 00), let us define 


De an {E> 03 XG) Sa}. (1.8) 


This is called the hitting time of the set {x}; it is optional. It 


is clear that 
T(a,b) = Ta A Ty 5 (1.9) 


If we keep 6 fixed while letting a + —oo, then 
im, T(a,b) = Tb. (1.10) 
It is worthwhile to convince oneself that (1.10) is not only in- 


tuitively obvious but logically correct, for a plausible argument 
along the same line might mislead from (1.6) and (1.10) to: 


P*{T, < co} = 1. (1.11) 


The result is true but the argument above is fallacious* and its 
refutation is left as Exercise 1 below. We proceed to a correct 


proof. 


Proposition 1. For every x and b in (—oo, oo), (1.11) is true. 


*It may be reported here that an applied analyst fell for it when asked in 
my class. 
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Proof. For simplicity of notation we suppose x = 0 and 
b = +1. Let a = —1 in (1.7). Since the Brownian motion 
is symmetric with respect to the directions right and left on 
the line, it should be clear that at the exit from (—1,+1), the 
path will be at either endpoint with probability 1/2. If it is at 
+1, then it hits +1. If not, the path is at —1, and we consider 
its further movement until the exit from (—3,+1). According 
to strong Markov property this portion of the path is like a new 
Brownian motion starting at —1, and is stochastically indepen- 
dent of the motion up to the hitting time of —1. Thus we may 
apply (1.6) again with x = —1, a = —3 and b = +1. Observing 
that —1 is the midpoint of (—3,+1), we see that at the exit 
from the latter interval, the path will be at either endpoint with 
probability 1/2. If it is at +1, then it hits +1. If not, the path 
is at —3, and we consider its further movement until the exit 
from (—7,+1), and so on. After n such steps, if the path has 
not yet hit +1, then it will be at —(2”—1) and the next interval 
for exit to be considered is (—(2"*'—1), +1). These successive 
trials (attempts to hit +1) are independent, hence the prob- 
ability that after n trials the path has not hit +1 is equal to 
(5)". Therefore the probability is equal to limno0.($)" = 0 
that the path will never hit +1; in other words the probability 
is 1 that it will hit +1 eventually. 

The scheme described above is exactly the celebrated gam- 
bling strategy called “doubling the ante”. [The origin of the 
name “martingale”.]| The gambler who is betting on the out- 
come of tossing a fair coin begins by staking $1 on the first 
outcome. If he wins he gets $1 and quits. If he loses he flips 
the coin again but doubles the stake to $2. If he wins then 
his net gain is $1 and he quits. If he loses the second time he 
has lost a total of $3. He then repeats the game but redoubles 
the stake to $4, and so on. The mathematical theory above 
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shows that if the game is played in this manner indefinitely, 
the gambler stands to gain $1 sooner or later. Thus it is a 
“sure win” system, the only drawback being that one needs an 
infinite amount of money to play the system. To get a true 
feeling for the situation one should test the gambit by going to 
a roulette table and bet repeatedly on “red or black”, which is 
the nearest thing to a fair game in a casino. 

There are many other proofs of Proposition 1, of which one 
will be given soon in Sec. 2. The proof above is however the 
most satisfying one because it reduces the problem to an old 
legend: if a coin is tossed indefinitely, sooner or later it will be 
heads! 


Exercise 1. What can one conclude by letting a — —oco 
in (1.6) while keeping x and b fixed? Figure out a simple 
Markov process (not Brownian motion) for which (1.6) is true 
but (1.11) is false. 


Exercise 2. Show that (1.11) remains true for x = b, a moot 
point in the proof given above. 
2. Time and Place 


The results in Sec. 1 are fundamental but qualitative. We now 
proceed to obtain quantitative information on the time 7 and 
the place x(r) of the exit from (a,b). Let us define 


(OSH PX) Sale Ply <i 


(2.1) 
RaSh ix) jb SP < Teh 


It is a consequence of (1.7) that 


pa(t)+p(z)=1, Var € (a,b). (2.2) 
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In order to solve for pa(x) and pp(x) we need another equation. 
The neatest way is to observe that the Brownian motion pro- 
cess is a martingale. More pedantically, let F; be the o-field 
generated by {x,,0 < s < t}, then for each x € R: 


{X1, F;, P”} is a martingale. (2.3) 


We leave the verification to the reader. The fundamental mar- 
tingale stopping theorem then asserts that 


{X(t Ar), F(t Ar), P*} is a martingale. (2.4) 
The defining property of a martingale now yields 
E*{X(0)} = E*{X(tAr)}. (2.5) 


The left member above is equal to x; the right member is an 
“incalculable” quantity. Fortunately we can easily calculate its 
limit as t > oo. For almost every sample point w, T(w) < co 
by (1.6) and tA r(w) = r(w) for t > r(w), hence limto. X (tA 
T) = X(r) without even the continuity of X(-). Since 


sup |X(tA7)| < lal V [Bl, (2.6) 
Q<t<0co 


the dominated convergence theorem allows us to take the limit 
as t > oo under the E* in (2.5) to conclude 


a = E*{X(r)}. (2.7) 


The novice must be warned that the verification of domination 
is absolutely de rigueur in such limit-taking, neglect of which 
has littered the field with published and unpublished garbage. 
[On this particular occasion the domination is of course trivial, 
but what if 7 is replaced by Tq for instance?] 


Time and Place 131 
Since X (7) takes only the two values a and 6, (2.7) becomes 
L = apa(x) + bpp(z). (2.8) 
Solving (2.2) and (2.8) we obtain 


Pa(x) = 72 , pele) = — , «€(a,b). (2.9) 


Note that (2.9) is valid only for x € [a,b], and implies 


pa(a) = 1. 
To obtain E*{r} we use another important martingale as- 
sociated with the process: 


{ X(t)? —t, F;, P*} is a martingale. (2.10) 
Application of the stopping theorem gives 
a? = B*{X(r At)? —(r At)}. 


Since X(7 At)? < a2 + b? and E*{r} < 00 (see Sec. 1) we can 
let t + oo and use dominated convergence (how?) to deduce 


a? = E*{X(r)*? —7} = a? pa(x) + b?p,(x) — E*{7}. (2.11) 
Together with (2.9) this yields 
E*{7a,)} = (w@— a)(b— =), (2.12) 


for all x € [a,b]. For z = a we have E*{7(q)} = 0, so 
P*{T(a,5) = O} = 1. 

From the last result it is tempting but fallacious to conclude 
that P°{T, = 0} = 1. In the next exercise we give a correct 
proof of this fact using the symmetry of Brownian motion, 
leaving the details to the reader. 
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Exercise 3. Prove the fundamental result 
Pal, a0 = 15 for alla e R. (2.13) 
We may take x = 0. Define T~ = inf {t > 0: X(t) < 0} and 


Tt = inf {t > 0: X(t) > 0}. From the results above deduce 
P°{T- = 0} =1land P°{T+ = 0} = 1; hence P°{Tp = 0} = 1. 


Exercise 4. Show that starting at any z, the Brownian path 
immediately crosses the z-level infinitely many times. 


Exercise 5. Show that for any x # b we have E*{T,} = +00. 
Exercise 6. Let x, — 2, then P*{T,, — 0} = 1. 


Exercise 7. Let tT = 14,4), t > 0 and v(x) = P*{r > t}. Show 
that y is concave in [a,b] and hence continuous. Let 21, 22, 
z € [a,b] with x = (1 — Ajay + Axe. Let 7! = T(2,,2,). Check 
that pz(%1) = 1— A and p,(%2) = A, so it follows from the 
strong Markov property that 


oy) 2 (1— A)p(#1) + Av(22). 


Exercise 8. Use Exercise 6 and symmetry to show that for 
each t the maximum of P*{7 > t} occurs at = (a + b)/2. 
3. A General Method 


To obtain comprehensive information regarding the joint dis- 
tribution of the time and place of exit from (a, 6), we introduce 
a more powerful martingale. 


Proposition 2. For any real value of the parameter a, 


Ai 
{exp (ax() - =) P,P? is a martingale. (3.1) 
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Proof. We begin with the formula 


E* {exp(az(t))} = exp (az + *) (3.2) 


The left member above being the probabilistic manifestation 
of the analytic formula 


(oe) 
1 (x — y)* 
ay ae 
| e€ oni exp { 7 dy 


its calculation is an exercise in beginner’s calculus. When a = 
10 where i = \/—1 and @ is real, (3.2) should be recognized as 
the familiar Fourier transform or characteristic function of the 
normal distribution commonly denoted by N(x, t). If follows 
that if 0 <s <t then 


To be formal about it this time, let us put 
X(u) = X(s+u) — X(s), for u>0; 


Since the Brownian motion is a process with stationary in- 
dependent increments, the “shifted” process X is a standard 
Brownian motion which is independent of F,. Using (3.2) we 
have 


E* {exp(a(X(t) — X(s)))|Fs} = E°{exp(aX(t — s))} 
a?(t — s) 
nwo (2854). 


which confirms the assertion in (3.1). 
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For a few moments let us denote the martingale in (3.1) by 
M(t). Then for every x € (a, )): 


e®* = B*{M(0)} = E*{M(tAr)}, (3.4) 
where 7 = T(a,p). Since 
[M(t A r)| < exp(lal(lal A ||), 


we obtain by dominated convergence just as in Sec. 2 that 


cr ate} = B {om (20-27) ;x(9)= 0} 


re fow (6-2) :x=r}. 8 


Putting 


2 
(3.6) 
2 
fula) = B* {exp (-") ix(r) =}, 
we have the equation 
er? =e f.(x) +e fy(z), x € (a,b). (3.7) 


We have also the equation 


falc) + fala) = B® {exp (-) \ (38) 


Unlike the situation in Sec. 2 these two equations do not 
yield the three unknowns involved. There are several ways of 
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circumventing the difficulty. One is to uncover a third hidden 
equation — the reader should try to do so before peeking at 
the solution given below. But this quickie method depends 
on a lucky quirk. By contrast, the method developed here, 
though much longer, belongs to the mainstream of probabilistic 
analysis and is of wide applicability. It is especially charming 
in the setting of R?. 

We begin with the observation that if x is the midpoint of 
(a,b) then fa(x) = f,(x) by symmetry so that in this case (3.7) 
is solvable for f.(x). Changing the notation we fix z and con- 
sider the interval (« — h,x +h). We obtain from (3.7) 


eer 1 


ea(z—h) a ea(2+h) = ech 4 e-ah 


fe—n(z) = 


and consequently by (3.8) 


. a? 1 
BE {exp (-F7%-n21m)} = ch(ah) ’ (3.9) 


Here ch denotes the “hyperbolic cosine” function, as sh denotes 
the “hyperbolic sine” function: 


04 9 0 _ p98 


€ € 
eye ee is 1 
ch > shé ; (3.10) 


With this foot in the door, we will push on to calculate f,(zx). 
Recall x € (a,b), hence for sufficiently small h > 0 we have 
[x —h,x +h] C (a,b) and so 


T(e—he+h) < T(a,b) - (3.11) 
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We shall denote 7/2_n,2+n) by T(h) below; observe that it can 
also be defined as follows: 


7(h) = inf {t > 0:|X(t) —X(0)| > A}, (3.12) 


namely the first time that the path has moved a distance >h 
(from whichever initial position). Now starting at z, the path 
upon its exit from (x —h,x +h) will be at « —h or & +h with 
probability 1/2 each. From the instant 7(h) onward, the path 
moves as if it started at these two new positions by the strong 
Markov property, because 7(h) is an optional time. This verbal 
description is made symbolic below. 


= {exp (2) . (3.13) 


Bx foxp (52) x) =a] 


It is crucial to understand why after the random shift 
of time given by 7(h), the “function to be integrated”: 
[exp(—%5*); X(7) = a] does not change. This point is gen- 
erally explained away by a tricky symbolism, but one should 
first perceive the truth with the naked eye. Anyway (3.13) may 
be written as 

4 a*r(h 1 
fo(z) = E {exp (-") } gtfa(a —h)+fa(at+h)}. 
(3.14) 
Using (3.9) we may rewrite this as follows: 


fa(z +h) — 2fa(z)+ fa(e—h)  2ch(ah) — 2 
h2 7 h? 


fa(x). (3.15) 


A General Method 137 


Letting h | 0 we see that the left member in (3.15) converges 
to a* f,(x). It is also immediate from (3.14) that 


fal) <slfale—A)+ fale +h)}, (3:16) 


valid fora <x-h<a2x+h< b. Since f, is also bounded, 
(3.16) implies f, is continuous in (a,b), in fact convex (see 
e.g.{Courant, Differential and Integral Calculus Vol. II, p. 326]). 
Now if fa has a second derivative f”” in (a,b), then an easy 
exercise in calculus shows that the limit as h | 0 of the left 
member in (3.15) is equal to f?’(x). What is less easy is to show 
that a close converse is also true. This is known as Schwarz’s 
theorem on generalized second derivative, a basic lemma in 
Fourier series. We state it in the form needed below. 


Schwarz’s Theorem. Let f be continuous in (a,b) and sup- 
pose that 


him f(t +h) ~ 2f(w) + Fle —h) _ 
h-0 h2 


v(x) Va € (a,b) 


where ~ is continuous in (a,b). Then f is twice differentiable 
and f" = » in (a,b). 

It is sufficient to prove this theorem when ¢y = 0 in (a,}), 
and a proof can be found in Titchmarsh, Theory of Func- 
tions, 2nd ed., p. 431. Since f, has been shown to be conti- 
nuous, Schwarz’s theorem applied to (15) yields the differential 
equation: 


fil(z) = a? f(z), x € (a,b). (3.17) 


The most general solution of this equation is given by 


fa(x) = Ae** + Be™™, (3.18) 
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where A and B are two arbitrary constants. To determine 
these we compute the limits of f,(z) as c a and x —> b, 
from inside (a,b). From (3.6) and (2.9) we infer that 


a b— 
lim fa(z) < lim E*{X(r) = a} = lim 5 — =0; 
a ~ ' (3.19) 
nr t—-a 

j i re 4 = fj a 
eye es ee 


Since f, > 0, the first relation above shows that lim,_,» 


fa(x) = 0, using (3.7) we see that 


ex < exe lim f(x) 
a—b 


so limz-45 fo(x) = 1. Similarly we have 
jim fale) = Ly lim fia = 0. (3.20) 
Thus we obtain from (3.18): 
0= Ae + Be, 1= Ae** + Be. 
Solving for A and B and substituting into (3.18), we obtain 


fle) = fale) = oe (3.21) 


where the second formula in (3.21) may be obtained from the 
first by interchanging a and b. Finally we have by (3.8): 


r= {ap (222) } = BAe=AethOle=9, ayy 


Exercise 9. The quick way to obtain (3.21) is to use (3.7) for 
—a as well as +a. 
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Exercise 10. Derive (2.9) from (3.21), and compute 
E*{T,;T, < Tp}. 
Answer: (b — x)(x — a)(2b — a — x)/3(b— a). 
Exercise 11. Show that for 0 < 6 < 17/2(b — a)”, we have 


cos (v2 (x — a5)) | 


E® {97 (a,b)} = 
cos (28 (252) ) 
Prove that E*{e97(%)} = +00 for 6 = 1?/2(b — a)?. 

A third way to derive (3.21) will now be shown. Since (3.7) 
is valid for —a@ as well as a, letting a 4 —oo we obtain 


a? 
elelt — elolb pe {exp (n)} : 


Changing a?/2 into A, writing y for b and observing that the 
result is valid for any x # y on account of symmetry of the 
Brownian motion with respect to right and left: 


Et{eTv} = exp(—V 2a — y)). (3.23) 


This equation holds also when x = y, but the argument above 
does not include this case. The little sticking point returns to 
haunt us! We can dispose of it as follows. If y > x then T, — 
T, almost surely (proof?), hence the Laplace transform of Ty 
converges to that of T,,, and (3.23) takes on the limiting form 


E*{e7} = 1. 


It follows that P*{T, = 0} = 1, namely Exercise 3 again. 
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Recalling that (44) = Ta \ Tb, we can write down the 
following relations: 


Br fet} 
= E*{exp(—A(T2 A Tb)); Ts < Ta} 
+ E*{exp(—A{Ta A Th});Ta < Te} - E*{e7*7?} 
Et{e Te} 
= E*{exp(\(Ta A Ts));Ta < To} 
+ E*{exp(—{Ta A Tp}); Ts < Ta} - E’f{e7*7"} . 


Using (3.23) we see that these two equations can be solved 
for the two unknowns which are just fa(x) and fp(z) after a 
change of notation. 


(3.24) 


4. Drift 


The methods above can be used to obtain analogous results 
for a Brownian motion with a constant drift, namely for the 
process: 

X(t)= X(t)+et (4.1) 
where X(t) is the standard Brownian motion and c is a nonzero 
constant. We may suppose c > 0 for definiteness. 

The strong law of large numbers implies that almost surely 


jim X(t) = +00. (4.2) 


The argument in Sec. 1 is still valid to show that exit from 
any given interval (a,b) is almost sure, but the analogue to 
Proposition 1 must be false. The reader should find out for 
himself that the martingales in (2.3) and (2.10) translated in 
terms of X , are not sufficient to determine 


Ba(x) = P*{X(7) =a}, plz) = P*{X(F) =}, (4.3) 
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where 


T= Ts = nk {t>0: X(t) ¢ (a, b)}. 


Fortunately, the martingale in (3.1) can be manipulated to 
do so. 
Take a = —2c in (3.1). We have 


(2c)t 
ea 


exp (-20x( ) =exp(—2cX(t)). (4.4) 


Write s(a) = e~?°7, then 
{s(X(t)), F;,P7} is a martingale. (4.5) 


It follows that 


s(x) = s(a)pa(x) + 5(b)po(z) (4.6) 
together with 
Pa(z) + po(z) = 1, (4.7) 
we obtain 
solo) = =) yy _ st) = 3a) 
Pa(@) =e s(b) = s(a) ’ Do ) s(b) — s(a) - (4.8) 


The function s is called the scale function. Compare (1.8) 
with (1.9). We can now use the martingale X(t) — ct to find 
E*{7}. As before, the stopping theorem yields 


a = E*{X(7) — c7} = apa(x) + bpp(x) — cE™ {7}, 


and so 


aise: ao) eae) ala 2G) Say) 
ae o(s(b) — (a)) | 
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More interesting is to use (4.8) to obtain information for 
the hitting time 


Ty = inf {¢>0: X(t) =y}. 


If we let b + oo in the first and a + —oo in the second equation 
in (4.8), the results are as follows: 
P®{T, <co} =e "9, a<a<oo; 
2 (4.9) 
P*{T, <cof}=1, -co<a<b. 
The second relation is of course an immediate consequence 
of (4.2), and the continuity of the paths. 
To obtain the distribution of T,, we return to the martin- 
gale in (3.1) translated in terms of X(t); 


exp (ak ~ (ac + 5) ) 


2 


a 
A= — 
al 


a= -ctV2rA+Cc?. 


we obtain in the usual manner 


Put 


e%? = e® E*{exp(—AT); X(7) = a} 
+ e* E* fexp(—A7); X(F) = b}. (4.11) 


Choose the + sign in a so that a > 0, and let a + —oo. Then 
choose the — sign in a so that a < 0, and let b > +00. The 
results may be recorded as follows: 


E*{exp(—ATy)} = exp(—/ 2A + e|a — y| — (x — y)). (4.12) 
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Using (3.24) we can obtain the joint distribution of X(r) 
and 7. In general the results are complicated but one in- 
teresting case emerges when x = 0, b > 0 anda = —b. 
In this case if we let f(A) = E°(exp(—A7); X(7) = 6) and 
f(A) = E%(exp(—A*); X() = —b), then (3.24) becomes 


exp(—/2 + cb + be) 

= f(A) + f-(A) exp(—V/2A + c2 (2b) + 2be) , 
exp(—/2 + 2b — be) 

= f_(A) + f(A) exp(—V/2A + €2(2b) + 2bc) . 


(4.13) 


Dividing each equation by its left hand side and subtracting, 
we obtain 


f(A) (exp(/2A + c2b — be) — exp(—/2A + c2b — be)) 
= f_(A)(exp(/2A — 2b + be) — exp(—/2A + 2b + be)), 
and consequently 
fe) = 2 f_(d). (4.14) 
Since we have also from (4.8) 
P°(X(#) = b) = e?©° P°(X (7) = -b), (4.15) 
it follows that 


E°(exp(—A7)|X(#) = b) = E°(exp(A7)|X (7) = —b). 
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That is, the exit time 7 and the exit place X(7) are indepen- 
dent. This curious fact was first observed by Frederick Stein.* 
Is there an intuitive explanation? 


Exercise 12. Almost every sample function X(-,w) has a min- 
imum value m(w) > —oo. Use the strong Markov property to 
show that m has an exponential distribution, and then find 
this distribution. 


Exercise 13. Show that almost every path of X reaches its 
minimum value m(w) only once. 


Exercise 14. This exercise, which is based on a result of 
J. W. Pitman and J. C. Rogers, shows that sometimes pro- 
cesses which are “obviously” not Markovian actually are. Let 
X* and X~ be independent Brownian motions with drifts +c 
and —c respectively and let € be an independent random vari- 
able which = +1 with probability p and = —1 with probability 
1 —p. Construct a process Y by letting Y, = X; on {€ = 1} 
and Y; = X; on {€ = —1}. The claim is that Y is a Markov 
process with respect to G:, the o-field generated by Y;, s < t. 

At first glance this seems false because watching Y; gives 
us information about € which can be used to predict the future 
development of the process. This is true but a little more 
thought shows 


PE =11G:) =e7' /(e** +e), 


Verify this by (4.9) and show that Y; is Markovian. [I owe this 
exercise to R. Durrett] 


*“An independence in Brownian motion with constant drift”, Ann. of 
Prob. 5 (1977), 571-572. 


Dirichlet and Poisson Problems 145 


5. Dirichlet and Poisson Problems 


In classical potential theory (see Kellogg [1]) there are a clutch 
of famous problems which had their origins in electromag- 
netism. We begin by stating two of these problems in Euclidean 
space R¢, where d is the dimension. Let D be a nonempty 
bounded open set (called a “domain” when it is connected), 
and let 0D denote its boundary: 0D = DM (D¢) where the 
upper bar denotes closure. Let A denote the Laplacian, namely 


the differential operator 


a. bg 


A function defined in D is called harmonic there iff Af = 0 
in D. This of course requires that f is twice differentiable. 
If f is locally integrable in D, namely has a finite Lebesgue 
integral over any compact subset of D, then it is harmonic 
in D if and only if the following “surface averaging property” 
is true. Let B(x,5) denote the closed ball with center x and 
radius 6. For each z € D and 6 > 0 such that B(x, 6) C D, we 


have 


(=e | swe) 62) 


8B(2,6) . 


where o(dy) is the area measure on 0B(z, 6). This alternative 
characterization of harmonic function is known as Gauss’s the- 
orem and plays a basic role in probabilistic potential theory, 
because probability reasoning integrates better than differen- 
tiates. 
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Dirichlet’s problem (or first boundary value problem). 
Given D and a continuous function f on OD, to find a function 
y which is continuous in D and satisfies: 


Ap=0 in D, 
(5.3) 
yp=f on OD. 


Poisson’s problem. Given D and a continuous function f in 
D, to find a function y which is continuous in D and satisfies 


Ag=f in D, 
(5.4) 
y=0 on OD. 


We have stated these problems in the original forms, of 
which there are well-known generalizations. As stated, a 
unique solution to either problem exists provided that the 
boundary OD is not too irregular. Since we shall treat only 
the one-dimensional case we need not be concerned with the 
general difficulties. 

In R}, a domain is just an bounded open nonempty interval 
I = (a,b). Its boundary OJ consists of the two points {a, b}. 
Since Af = f”, a harmonic function is just a linear function. 
The boundary function f reduces to two arbitrary values as- 
signed to the points a and b, and no question of its continuity 
arises. Thus in R! Dirichlet’s problem reads as follows. 


Problem 1. Given two arbitrary numbers f(a) and f(b), to 
find a function y which is linear in (a,b) and continuous in 
[a, b], such that y(a) = f(a), y(b) = f(b). 

This is a (junior) high school problem of analytic geometry. 
The solution is given by 


b-2 xr—-a 
A ns weer 


f(b). (5.5) 
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Now we will write down the probabilistic solution, as follows 
y(x) = E*{f(X(r))}, x € (a,b) (5.6) 


where T = 7a). If we evaluate the right member of (5.6) 
by (2.9), we see at once that it is the same as given in (5.5). 
But we will prove that ¢ is the sought solution by the general 
method developed in Sec. 3, because the same pattern of proof 
works in any dimension. Using the 7(h) of (3.12), we obtain 


pla) = BEX F(X(r))}} = {ole —h) + ole +h)} 
(5.7) 


for any h for which (3.11) is true. This is the one-dimensional 
case of Gauss’s criterion for harmonicity. Since y is bounded 
it follows from the criterion that y is harmonic, namely linear. 
But we can also invoke Schwarz’s Theorem in Sec. 3 to deduce 
this result, indeed the generalized second derivative of y is 
identically zero by (5.7) 

It remains to show that as x — a or b from inside (a,b), 
v(x) tends to f(a) or f(b) respectively. This is a consequence 
of the probabilistic relations below: 


lim P*{r=Ta}=1, lim P*{r =Ty} = 1 (5.8) 


which are immediate by (2.9). But since no such analogue is 
available in dimension > 1, another proof more in the general 
spirit is indicated in Exercise 15 below. Assuming (5.8), we 
have 


g(x) = E*{f(X(Ta))37 = Ta} + BP {f(X(Th)); 7 = To} 
= P*{r =Ta}f(a) + P*{r = To} f(b), 
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and consequently 
lim g(z) = 1- f(a) +0- (6) = F(a); 
lim (2) =0- f(a) +1- f(b) =f). 
Thus the extension of y to [a,b] agrees with f at a and b. 


[Since y is linear in (a, b), it has a trivial continuous extension 
to [a,b]. This no longer trivial in dimension > 1.] 


Exercise 15. Show that for any « > 0, 
i ag < aie le 
iim PE {To SNS e} 1 


This is equivalent to lim, 49 P°{T_, < e} = 1, and is a case of 
Exercise 6. Now derive (5.8) from (5.9). 


Problem 2. Given a bounded continuous function f in (a,b), 
to find a function y which is continuous in [a,b] such that 


g'(x)=—f(z), for x € (a,b), (5.10) 


The constants 5 and —1 in the differential equation are 
chosen for the sake of convenience, as will become apparent 
below. This is a simple calculus problem which can be solved 
by setting 


x 


p(x) = [rw — x)f(y)dy+cx+d 


a 
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and determining the constants d = —ca and 


b 
c= (b- a) f (b— yw) fly)dy 


by the boundary conditions y(a) = 0 and y(b) = 0. Substitut- 
ing these values for c and d and rearranging we can write the 
solution above as 


b 


p(x) = [oemsoay (5.11) 
where 
sear ee Peery 
HEY) = o4—2)(y—a) (5.12) 


‘ ifa<y<a<b. 


Note that g(x,y) > 0 in (a,b) and g(z,y) = g(y,x). We put 
g(x,y) = 0 outside (a,b) x (a,b). The function g is known as 
the Green’s function for (a,b) because representing the solu- 
tion of (5.10) in the form (5.11) is an example of the classical 
method of solving differential equations by Green’s functions 
(see Courant and Hilbert [2] Ch. V. 14 and Exercises 17 and 
18 below). 

Now we will write down the probabilistic solution of Prob- 
lem 2, as follows: 


ple) = E* | / joy (5.13) 


Note that the integral above may be regarded as over (0,7) so 
that f need be defined in (a,b) only. Without loss of generality 
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we may suppose f > 0; for the general case will follow from this 
case and f = ft —f~. To show that ¢ satisfies the differential 
equation, we proceed by the method of Sec. 3, we have 


rh) or 
oe) =E4 | f+ f | sexwpe 


OQ  r(h) 


T(h) 
=F i f(X(t))dt 


+E {evo | / jx _ (5.14) 
0 


Let us put 


then 
pla) =V(x,h) + s{ole +h) +ele—A)}. (6.16) 


Since f > 0, Y 2 0; also y(z) < ||f\|B*{7} < |IFII(6- )?/4. 
Thus y is continuous and concave. Now write (5.16) as 


g(a +h) —2p(2) + o(a@~-h) _ _ 2(z,h) 
h2 h2 : 


To calculate the limit of the right member of (5.17) as h > 0, 
we note by (2.12): 


(5.17) 


E*{r(h)} =h?. (5.18) 
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Next we have 
T(h) 


Veh) — Fe@)Er(}= EP) f [F(X() — F(X(O)lae 


(5.19) 


Since f is continuous at x, given e > 0 there exists ho(e) such 
that if |y — x| < ho(e) then |f(y) — f(x)| < «. Hence if 0 < 
h < ho, we have |f(X(t)) — f(X(0))| < e for 0 <t<7(h) and 
so the absolute value of the right member of (5.19) is bounded 
by E*{er(h)} = ch?. It follows that the left member of (5.19) 
divided by h? converges to zero as h — 0, and consequently 
by (5.18) 


= f(z). (5.20) 


Since y is continuous by concavity from (5.16), and f is con- 
tinuous by hypothesis, an application of Schwarz’s Theorem 
yields the desired result 


gp" (x) = —2f (2). 


lim v(z, h) S y) 


h->0 


Furthermore since 


Ie(@)| SFI}, 


p(x) converges to zero as x > a or x + b by (2.12). On the 
other hand y(a) = y(b) = 0 by (2.13). Thus ¢ is continuous 
in [a,b] and vanishes at the endpoints. 

If we equate the two solutions of Problem 2 given in (5.12) 
and (5.13), we obtain 


b 


E* {sexo = J oensedy (5.21) 


a 
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for every bounded continuous f on (a,b). Let us put for z € R! 
and B € B!: 


V (a, B) = E* / Lp(X(t))at (5.22) 


0 


Then it follows from (5.21) and F. Riesz’s theorem on the rep- 
resentation of linear functionals on (a, b) as measures (see, e.g., 
Royden [4, p. 310]) that we have 


Vie B\= i g(t, u)dy. (5.23) 
B 


In other words, V(z,-) has g(z,-) as its Radon—Nikodym 
derivative with respect to the Lebesgue measure on (a,b). The 
kernel V is sometimes called the potential of the Brownian mo- 
tion killed at 7. It is an important object for the study of this 
process since V(x, B) gives the expected occupation time of B 
starting form z. 


Exercise 16. Show by using elementary calculus that the so- 
lutions to Problems 1 and 2 in R! are unique. 


Exercise 17. Define a function g(z, y) in [a, }] as follows. For 
each x let gz(-) = 9(z,-). 


(i) ga is a continuous function with g,(a) = gz(b) = 0; 


(ii) for allz#y, g2(y) =0, 
(iii) lim.—o(g,(x + €) — 9,(% — €)) = —1. 


Show that the function g(z,y) defined by (5.12) is the only 
function with these properties. 
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Example 18. Let the function 6, have the defining property 
that for any function f on (a,b) we have 


6b 
[oeorau = sy). (5.23) 


This 6, is called the Dirac delta function [never mind its 
existence!]. It follows that 


[bulw)du = Noo(y)- (5.24) 


We can now solve the differential equation 
h” = —26,, h(a) = h(b) =0. 


by another integration of (5.24). Carry this out to obtain 
h(x) = g(x,y), which is what results if we let f = —2d, 
in (5.21). 


Exercise 19. Determine the measure H(z,-) on OF so that the 
solution to Problem 1 may be written as 


/ f(y) H(z, dy). 
or 


The analogue in R? is called the harmonic measure for J. It 
is known in the classical theory that this measure may be ob- 
tained by taking the “interior normal derivative” of g(x, y) with 
respect to y. Find out what this means in R?. 


Exercise 20. Give meaning to the inverse relations: 


A A 
S(-G)=1, (-@QZ=1 
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where I i the identity, and G is the operator defined by 
Gf(x) = J. 9(t,y)f (y)dy. 


Exercise 21. Solve the following problem which is a combina- 
tion of Problems 1 and 2. Given fg on OJ and continuous fy 
in I, find y such that ¢ is continuous in J and satisfies 

1 


ge Shi ind; 


yp=fe2 ondl. 


6. Feynman—Kac Functional 


As a final application of the general method, we will treat a 
fairly new problem. Reversing the previous order of discussion, 
let us consider 
ole) = BY exp f a(X()at- F(X) fp» FE [a,b] (6.1) 
0 


where g is a bounded continuous function in [a,b], f as in 
Problem 1 (6.1). Note that by Exercise 3: 


y(a)= fla), — p() = F (0). 


The exponential factor in (6.1) is called the Feynman—Kac 
functional. 

An immediate question is whether ¢ is finite. If q =a 
constant c, and f = 1, then y = oo for sufficiently large c, by 
Exercise 11. 

Let us write e(u) = f) q(X(t))dt for u > 0. 


Proposition 3. Suppose f > 0 in (6.1). If yp =o in (a,b), 
then ~ is continuous in [a, b]. 
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Proof. Let (xo) < oo, and x © zo, x € (a,b). Then we have 
by the strong Markov property 


CO > v(xo) > E*°{e(r); Tr < 7} = E*°{e(T,.); Te < T}y(zx). 


Since P*°{T, <7} >0 and e(T;,) > 0, this implies v(x) < oo. 
Next, given any A > 0, there exists 6 > 0 such that we have 


E™{e4T®)} < oo, (6.3) 


This follows from the derivation of (1.7). Consequently we have 
by dominated convergence 


: xf pAr(h)y — 
jim E*{e }=1, (6.4) 
We now state a lemma. 


Lemma. Let ¢ be a finite nonnegative function on [a,b] hav- 
ing the following approximate convexity property. For each 
[v1, £2] C [a,b], 0 < 22-21 < d(e) and x = Az, + (1— A)ao, 
0<A<1, then 


(1 — €){Ay(21) + (1 — A) p(z2)} 
< p(x) < (1+ €){Ap(z1) + (1 — A)y(22)}. (6.5) 


Such ap is continuous in [a, 6]. 


Proof of the lemma. Let ¢ = 1, h < 6(1) and A = §, then 
we have by (6.5) 


o(x) < {p(e@+h)+p(e—h)}. (6.6) 


For a suitable h we can divide [a,b] into a finite number of 
subintervals of length 2h each. If we apply (6.5) to each subin- 
terval we see that ¢ is bounded. 
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Now fix x in (a,b) and shrink [21,22] to x in such a way 


that A > 1 and y(z1) > lim ¢y(y) or (x1) > limg(y), we 
yite ytte 
obtain from (6.5): 


d= e) lim p(y) < (x) < (1+) lim y(y). 
vite ytte 


Similarly for y || x. Since ¢« is arbitrary this shows that ¢ is 
continuous at x. A similar argument shows that ¢ is unilater- 
ally continuous at a and at b. Lemma is proved. 

We return to y and generalize the basic argument in Sec. 3 
by considering the first exit time from an asymmetric interval 
(cx —h,x +h’) C (a,b), starting from x. Recall that 


h' 


Pre %. < Tet = hth . 


(6.7) 


For sufficiently small h and h’ we have by (6.4): 
l-e< EB {e-@"}, Eee" } clte (6.8) 


where Q = |[q|| and 7* = t(g_p 2th’). The strong Markov 
property yields, for the y in (6.1) with an arbitrary f: 


/ 


ola) = B*{e(r")} {aa ele) + paola +h) 
(6.9) 


Using (6.8) and (6.9) we see that if f > 0 in (6.1) then 
satisfies the conditions of the Lemma, and is therefore conti- 
nuous in [a, }]. In particular this is true when f = 1,,} or 1,4). 
Hence it is also true for the y in (6.1) for an arbitrary finite f. 
Proposition 3 is proved. 

Let us write Yq and y» for the y in (6.1) when f = Iya} 
and f = 1,5; respectively. According to Proposition 3, either 
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Ya = 00 OF Y, is bounded continuous in |a, b], and similarly for 
yp. However, it seems possible that ya = co but vy, co in 
[a, b], or vice versa. For a general f, we have 


o(x) = f(a)pa(x) + f(b) pole), (6.10) 


provided the right member above is not +00 — 00 or —0c0 +. 
This is certainly the case under the hypothesis of the next 
proposition. 


Proposition 4 Suppose that ya * co and yy *¥ co in (a,d). 
Then for any f > 0 we have 


1 is 
5” taqp=0 
in (a,b), and y is continuous in [a, b]. 
Proof. Write 
E*{e(r(h))} =1+ Y(z,h); (6.11) 
then (6.9) for kh = h’ takes the form: 
x+h)—2y(2)+y9(x—-h w(a,h 
vies Nola) tele —A) MEM era ple). 
(6.12) 
Since we have proved that y is continuous, the quantity 


in (6.12) will converge to — limp_,o[¢(z, h)/h?]2p(x) as h > 0, 
provided that the latter limit exists. To show this we need 


E*{r(h)?} = a (6.13) 
also that for sufficiently small h we have by (6.4) 


Beeson 9. (6.14) 
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Exercise 22. Prove (6.13). Can you get a general formula for 
E*{r(h)*}, k > 1? 
Using the trivial inequality /u < e“ for all 0 <u < oo, we 


have 
r(h)e97(h) 2 7(h)3/2¢e(@+d)r(h) ‘ 


Hence by Hélder’s inequality (6.13) and (6.14) 


E® {r(h)2e@7()} < E®{r(h)P2 Pr Eer{eetyr(a t/a < ch? , 
(6.15) 


where c, is a constant. Next we use the inequality 


2 
U 
|e" — 1 —u| < ern 


valid for all u, to obtain 
T(h) 


E® 2 le(r(h)) —1— / g(X(t))dt 


2 


E jf ax@pe ear (h) 


< 2 Bre (hee, 


The last term divided by h? converges to zero as h — 0, 
by (6.15). Hence by (5.20) with f replaced by gq: 


h-0 h2 


i r(h) 
lim a = lim Ey. i axa = q(x). 


Therefore Schwarz’s theorem applied to (6.12) yields 
y’’ = ~—2q as asserted. Note that the continuity of y is required 
here also. Proposition 4 is proved. 
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Propositions 3 and 4 together give a dichotomic criterion 
for the solvability of the following problem. 


Problem 3. Given a bounded continuous function gq in (a, b) 
and two arbitrary number f(a) and f(b), to find a function y 
which is continuous in [a,b] such that 


50" (2) +a(x)ola)=0, 2 € (a,b); 


p(a)= fla), (6) = f(®). 


(6.16) 


Exercise 23. Is the solution to Problem 3 unique when it 
exists? 


Exercise 24. Solve the problem similar to Problem 3 but with 
the right side of the differential equation in (6.16) replaced by a 
given bounded continuous function in (a, b). This is the Poisson 
problem with the Feynman—Kac functional. 


Exercise 25. Prove that if the equation in (6.16) has a positive 
solution ¢ in (a,b), then for any [c,d] C (a,b), we have 


g(x) = E*{e(7c,a) ) P(X (7,4) }, 2 € [ed]. 


In particular, 
xz — E*{e(t.,a)} 


is bounded in |[c, d]. 
Exercise 26. Prove that if the differential equation in (6.16) 


has a positive solution in each interval (c,d) such that [c, d] C 
(a,b) (without any condition on the boundary {c,d}) then it 
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has a positive solution in (a,b). These solutions are a prior 
unrelated to one another. 


Exercise 27. Is it possible that y~, = oo in (a,b) whereas 
(p X= CO in (a,b)? Here Yq and y, are defined before Proposi- 
tion 4. This is a very interesting problem solved by M. Hogan, 
a graduate student in my class. 
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1. Introduction™ 


Let X = {X(t),t > 0} be astrong Markov process with contin- 
uous paths on R = (—oo, +00). Such a process is often called 
a diffusion. For each real b, we define the hitting time 7) as 
follows: 


7} = inf{t > 0| X(t) = b}. 


Let P, and E, denote as usual the basic probability and ex- 
pectation associated with paths starting from a. It is assumed 
that for every a and b, we have 


Pfi< 00} = 1. (1) 


Now let g be a bounded Borel measurable function on R, and 
write for brevity 


t 


e(t) = exp [axes ; (2) 


0 


——  — 
Séminaire de probabilités, XIV (1978/9), Strasbourg, France. 
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This is a multiplicative functional introduced by R. Feynman 
and M. Kac. In this paper we study the quantity 


u(a, b) = Eq{e(ts)} : (4) 


Since q is bounded below, (2) implies that u(a, b) > 0 for every 
a and b, but it may be equal to +oo. A fundamental property 
of u is given by 


u(a, b)u(b, c) = u(a,c) , (5) 
valid fora<b<c,ora>b>c. This is a consequence of the 
strong Markov property (SMP). 

2. The Results 


We begin by defining two abscissas of finiteness, one for each 
direction. 


6 =inf{b € R| Ja < b: u(a,b) = co} 
= sup{b € R|Va < b: u(a,b) < oo}; 

a =sup{a € R| 4b >a: u(b,a) = co} oi 
= inf{a € R|Vb >a: u(b,a) < oo}. 


It is possible, e.g., that G = —oo or +00. The first case occurs 
when X is the standard Brownian motion, and q(x) = 1; for 
then, u(a,b) > Ea(7») = 00, for any a £ b. 


Lemma 1. We have 
(@ = inf{b € R| Va < b: u(a,b) = co} 
= sup{b € R| da <b: u(a,b) < co}; 
a =sup{a € R|Vb >a: u(b,a) = oo} 
= inf{a € R|4b> a: u(b,a) < oo}. 
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Proof. It is sufficient to prove the first equation above for £, 
because the second is trivially equivalent to it, and the equa- 
tions for a follow by similar arguments. Suppose u(a, b) = 00; 
then for x < a < b we have u(z,b) = oo by (5). Fora<xu<b 
we have by SMP, 


u(x, b) > E,{e(Ta); Ta < T}u(a, b) = co 


since P,{T. < 74} > 0 in consequence of (2). 

The next lemma is a martingale argument. Let J; be the 
o-field generated by {X,,0 < s < t} and all null sets, so that 
Sit =U: for t > 0; and for any optional 7 let J, and J,+ and 
3,- have the usual meanings. 


Lemma 2. Ifa <b< £, then 
lim u(a, 7) eam he (7) 


lim u(a, byt. (8) 


Proof. Let a < b, t b and consider 
Ea{e(7)|3(7,)}, n>. (9) 


Since 6 < 6, u(a,b) < oo and the sequence in (9) forms a 
martingale. As nt 00, 7, t 7 a.s. and 3(7,,) tT 3(7-). Since 
e(7») € 3(7%-), the limit of the martingale is a.s. equal to 
e(t»). On the other hand, the conditional probability in (9) is 
also equal to 


Tb 


Bde / HX | IGS Se ale 


Than 
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As n ¢ oo, this must then converge to e(7)) a.s.; since e(T», ) 
converges to e(7») a.s., we conclude that u(bn,b) > 1. This 
establishes (7). 

Now let 6 > b> a, | a, and consider 


Eu{e(t)|3(ta,)}5 n>1. (10) 


This is again a martingale. Although a — T, is a.s. left con- 
tinuous, not right continuous, for each fixed a we do have 
Ta, + Ta and 3(Ta,) | 3(ta). Hence we obtain as before 
u(an, b) > u(a, b) and consequently 


u(a, b) 
u(an, b) 


ulara,,) = 


This establishes (8). 
The next result illustrates the basic probabilistic method. 


Theorem 1.* The following three propositions are equivalent: 
(i) B = +00; 
(ii) a = —00; 
(iii) For every a and b, we have 


u(a, b)u(b,a) <1. (11) 


Proof. Suppose z(0) = b and let a < b <c. If (i) is true then 
u(b,c) < oo for every c > b. Define a sequence of successive 
hitting times T,, as follows (where 6 denotes the usual shift 
operator): 


oS TA f a fi ee 


Tan = Tan—1 + T° OT, ,_; 5 Ton+1 = Ton + S 0 O7,, ; 
(12) 


*This odd result looks like a case of time-reversibility. Nobody has been 
able to authenticate this surmise. (March 2001). 
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for n > 1. Define also 
N = min{n > 0|Ton41 = Te}. (13) 
It follows from P,{t- < oo} = 1 that 0 < N < was. For 
n > 0, we have 


on Tre41 


Byfe(r.)N =n} = Bs exp [> f a(X(s))ds 
k=0 Ty 


= Ey{e(ta)3 Ta < Te}” 


x Eafe(t)}”Ep{e(Te); Te < Ta} - 
(14) 


Since the sum of the first term in (14) over n > 0 is equal to 
u(b,c) < oo, the sum of the last term in (14) must converge. 
Thus we have 


Ex{e(Ta); Ta < Te}u(a,b) <1. (15) 


Letting c — oo we obtain (14). Hence u(b,a) < oo for every 
a < b and so (ii) is true. Exactly the same argument shows 
that (ii) implies (iii) and so also (i). 

We are indebted to R. Durrett for ridding the next lemma 
of a superfluous condition. 


Lemma 3. Given anya € R and Q > 0, there exists an 
€ = €(a,Q) such that 


Eg{e??*} < 00 (16) 


where 


o-. = inf{t > 0|X(t) g (a-—¢,a+e)}. 
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Proof. Since X is strong Markov and has continuous paths, 
there is no “stable” point. This implies P,{o. > 1} > 0 as 
€ — 0 and so there exists € such that 


Pidgeo ice (ey, (17) 


Now og; is a terminal time, so x + P,{o- > 1} is an excessive 
function for the process X killed at o,. Hence by standard the- 
ory it is finely continuous. By for a diffusion under hypothesis 
(2) it is clear that fine topology coincides with the Euclidean. 
Thus z > P,{o- > 1} is in fact continuous. It now follows 
that we have, further decreasing e¢ if necessary: 

sup P,{oe > 1} <e (@t), (18) 


|jn—al<e 
A familiar inductive argument then yields for all n > 1. 
Pirlo, > n} < e M+) (19) 


and (16) follows. 


Lemma 4. For any a < £ we have 
u(a, 8) = 00; (20) 


for any b > a we have u(b, a) = oo. 


Proof. We will prove that if u(a,b) < oo, then there exists 
c > b such that u(b,c) < oo. This implies (20) by Lemma 1, 
and the second assertion is proved similarly. 
Let Q = ||q||. Given b we choose a and d so that a<b<d 
and 
Ey{e@78)}) < 00, (21) 


This is possible by Lemma 3. Now let b < c < d; then asc | b 
we have 
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Ep{elta); Ta < Te} < Ey{e@ tera), Ta < Tey >0 (22) 


because P,{Ta < T-} — 0. Hence there exists c such that 


Pyle talita < Te} < _— (23) 


(a,b) | 
This is just (15) above, and so reversing the argument there, 
we conclude that the sum of the first term in (14) over n > 0 
must converge. Thus u(b, c) < 00, as was to be shown. 
To sum up: 


Theorem 2. The function (a,b) > u(a, b) is continuous in the 
region a <b < B and in the regiona <b <a. Furthermore, 
extended continuity holds ina<b< 6 anda<b<a, except 
at (8,2) when B < ov, and at (a,a) when a > —oo. 


Proof. To see that there is continuity in the extended sense at 
(a, 2), where a < , let a < b, + B. Then we have by Fatou’s 
lemma 


lim u(a, b,) > a4 lim e(to)} 
noo n—0o 
= E.{e(7g)} = u(a, B) = 00. 
If B < ov, then u(@, G) = 1 by definition, but u(a, B) = oo for 
all a < (; hence u is not continuous at (3,3). The case for a 
is similar. 


3. The Connections 


Now let X be the standard Brownian motion on R and gq be 
bounded and continuous on R. 
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Theorem 3. Suppose that u(x,b) < co for some, hence all, 
x<b. Then u(-,b) is a solution of the Schrédinger equation: 


1 
a +qyp=0 
in (—oo, b) satisfying the boundary condition 
lim p(x) = 1. 
x—b 


There are several proofs of this result. The simplest and 
latest proof was found a few days ago while I was teaching a 
course on Brownian motion. This uses nothing but the theorem 
by H. A. Schwarz on generalized second derivative and the con- 
tinuity of u(-,b) proved in Theorem 2. It will be included in a 
projected set of lecture notes.* An older proof due to Varadhan 
and using Ito’s calculus and martingales will be published else- 
where. An even older unpublished proof used Kac’s method 
of Laplace transforms of which an incorrect version (lack of 
domination! see Part I, p. 11) had been communicated to me 
by an ancien collégue. 

But none of these proofs will be given here partly because 
they constitute excellent exercises for the reader, and partly 
because the results have recently been established in any di- 
mension (for a bounded open domain in lieu of (—oo, b)). 

These are in the process of consolidation and extension. 
(See Part I, References [F].) 

I am indebted to Pierre van Moerbeke for suggesting the 
investigation in this note. The situation described in Theo- 
rem 1 for the case of Brownian motion apparently means the 
absence of “bound states” in physics! 


*Part II. 


Green, Brown, and 
Probability & Brownian 
Motion on the Line 


This book consists of two parts. 
Part 1 constitutes the second 
edition of the author's widely 
acclaimed book Green, Brown, and 
Probability (1995). In this exposition, 
the author presents from a 
historical perspective the beautiful 
relations between the Brownian 
motion process in probability 
theory and several fundamental 
propositions in mathematical 
physics associated with Newton, 
Coulomb, Laplace, Gauss, Green, 
and Feynman. 


Part 2 of the book comprises 
lecture notes based on a short 
course on "Brownian Motion on 
the Line“ which the author gave at 
Stanford University. The metho- 
dology of probabilistic thinking is 
emphasized to illustrate the grand 
themes expounded in Part | in the 
relatively simple case of one- 
dimensional space. Numerous 
exercises are retained for possible 
classroom instruction. 
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